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OM nucleic - nucleic search, using sw model 



Run on: 



December 12, 2003, 09:18:36 ; Search time 246.613 Seconds 

(without alignments) 
7347.063 Million cell updates/sec 



Title: 



Sequence : 



US-09-931-157-1 



Perfect score: 4105 



1 gaattcgcggccgcctcttg taaaagtttacagaaacctt 4105 



Scoring table: IDENTITYJSfUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



569978 seqs, 220691566 residues 



Total number of hits satisfying chosen parameters: 



1139956 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/2/ina/5A_COMB. seq: * 

2 : /cgn2_6/ptodata/2/ina/5B_COMB. seq: * 

3 : /cgn2_6/ptodata/2/ina/6A_COMB . seq: * 

4 : /cgn2_6/ptodata/2/ina/6B_COMB. seq: * 

5 : /cgn2_6/ptodata/2/ina/PCTUS_COMB.seq: * 

6 : /cgn2_6/ptodata/2/ina/backf ilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-08-121-446-1 

; Sequence 1, Application US/08121446 
; Patent No. 6313276 
; GENERAL INFORMATION: 

APPLICANT: IMURA, HIROO 

APPLICANT: NAKAO, KAZUWA 

APPLICANT: NAKANISHI, SHIGETADA 

TITLE OF INVENTION: A HUMAN ENDOTHELIN RECEPTOR 
NUMBER OF SEQUENCES: 4 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: MORRISON & FOERSTER 

STREET: 755 Page Mill Road 

CITY: Palo AltO 

STATE: California 

COUNTRY : USA 

ZIP: 94304-1018 
COMPUTER READABLE FORM: 



MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC- DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/121,446 

FILING DATE: 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/911,684 

FILING DATE: 10-JUL-1992 
ATTORNEY/ AGENT INFORMATION: 

NAME: CIOTTI, THOMAS E. 

REGISTRATION NUMBER: 21,013 

REFERENCE/DOCKET NUMBER : 29900-20324.00 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 813-5600 

TELEFAX: (415) 494-0792 

TELEX: 706141 
; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 4105 base pairs 

TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
FEATURE : 

NAME/ KEY: CDS 

LOCATION: 485.. 1768 
FEATURE : 

NAME/KEY: mat_pept ide 

LOCATION: 545 
US-08-121-446-1 

Query Match 100.0%; Score 4105; DB 4; Length 4105; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4105; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 60 



Db 



1 




Qy 



61 



AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 



Db 



61 




Qy 



121 



CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 180 



Db 



121 




Qy 



181 



CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 



Db 



181 




Qy 



241 



GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 30 0 



Db 



241 




Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy ■ 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 
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Ov 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 



CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 3 60 

MMIIMM MMMMMMIIMM MMMMIMMIMM Illllllll 

CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 

1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 II 1 1 

TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 420 

AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 480 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 48 0 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

1 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 III I Ml 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II II 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

CAGTGATAATCCTGAGAGATACAG CACAAATCTAAGCAAT CATGTGGATGATTT CACCAC 600 

' II II HHIIU II lllll I III I III II' M II II II 

CAGTGATAATCCTGAGAGATACAG CACAAATCTAAGCAAT CATGTGGATGATTT CACCAC 600 

TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

IMIIIIIMIIIIIIIIIIMIIIIIIIIIIMIIIMIIIIIIIIMMIIIIIIIII 

TTTTCGTGGCAC^GAGCT^GCTTCCTGGTTACOVCTCATCAACCCACTAATTTGGTCCT 660 

ACCCAGCAATGGCTCAATGCACAACTATTGCCCAC^GCAGACTAAAATTACTTCAGCTTT 72 0 

1 1 1 1 1 1 1 1 1 1 1 1 II ! 1 1 1 1 1 1 1 1 1 Ml Mil! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llll I III I 

ACCC^GCAATGGCTC^TGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTT 72 0 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 

MMMIIIIIIIMIIIIIMIIIIMIIMMIIIMIIIIIIIIIIIMIMIMM 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 



M M' I II I .11 II I II I M,!i I M ,1 



AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

II IIMMI III llll III MINIM III III IIMMI Mill M l II llll III III 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 



I M III h II L II J IN I Ml M II II I illl I M 



GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

II IIMMI III II Mill IIMMI llll III Mill MINI III llllllllll 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 1020 
TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 108 0 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 108 0 
TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

I III II' ll'll: I II I II ,1, II ll'll I'll II I II 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 



Qy 1141 TGAAGCGATTGG CTTCGTCATGGTAC C CTTTGAATATAGGGGTGAACAG CATAAAACCTG 12 00 
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1921 


Db 
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1981 



IIIIIIIMIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIII 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 120 0 
TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 

Ml I :| 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 Ml 1 1 I! 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 II 1 1 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 
CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 1320 

IIMIIIII IlilllllllllllllllllllllMIIIII jlllllllll MINN III 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 
TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 1380 

I II I IM ll II M II II IMIMMI MMM MMI Ml 

TTGTGAGATGTTGAACAGAAGGAATGG CAGCTTGAGAATTGC CCT CAGTGAACATCTTAA 1380 
GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

M II IM I II I MM I MMIMMI II Mill M II M I 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 
GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 15 00 

II M I MM I II i II I IMM II II M MM l| 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 
CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

: ii ' i ! . i 1 ! 1 1 1 1 ,ii;ii , i 1 1 1 , i , , 1 1 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 



1 1 it 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 ) 1 1 r 1 1 1 1 1 



TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

MMM III II IMIMIMM M IMIMMI Mill IMM 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 



I M II II II M II M IM MMI M IMMI || IM I I 



GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

1 1 1 1 1 1 M ! 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M i 1 1 1 1 1 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 
AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

IMIMIMIIIIIMIMIIIIIMIIIIIIIIIIMIIIMIIIIIIIMMMIIII 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 



MM 1 1 M I M III M 1 1 M M 1 1 MIIMI MM 1 1 1 M 1 1 1 MM 1 1 1 II M 1 1 M 1 1 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 
CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 

MMI II MIMM MMI MMM MMI Ml IM II Mill MM! 

CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 
TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 

IIIIIMIIIIMMIMIIIIMIMMIIMIIIIIIIIIIIIMIIIIIIIIIIIII 
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1981 
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Db 


2041 
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2161 
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2161 


Qy 
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Db 
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Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


vy 


on c\ 
z / D ± 


Db 


2761 


Qy 


2821 


Db 


2821 



1981 TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 204 0 



ii mm ii i i ii i mm mm imm immm 



CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 

I Ml M I II !M || IMMIMMMIMI IMMM 

CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 

MM M I M I M II MMI MM MIMIMII MM 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 

ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 228 0 

1 1 M I 1 1 1 I 1 1 ; I ' I ! I 1 1 1 I i 1 1 ' 1 1 , . ! 1 1 M I I 

ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 2280 

TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 

II I M II MM II I Ml II II Mill MMII MM 

TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 

AATAGTATTCAGGTGAG CAATTAGATTAGTATTTT CCACGTCACTATTTATTTTTTTAAA 24 00 

I MM I II MM II M MM ! M II M II MM 

AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 

ACACAAATTCTAAAGCTAC^CAAATACTAC^GGCCCTTAAAGCACAGTCTGATGACAC^ 24 6 0 

1 1 1 1 1 ! i I 1 1 1 i 1 1 1 1 111! 1 1 1 1 1 1 1 1 ! 1 1 1 1 . 1 1 1 1 1 1 1 1 1 ! 1 1 ! I [ 1 1 1 1 ! 1 

ACACAAATTCTAAAGCTACAACAAATACTACA^^ 24 60 



MMMMM MM MM MM MMMMMMM MMMMMMMM 



ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 258 0 

I II Ml III MM II M MMMMM II MMMMM 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 258 0 



Ml IMMM MMII M IMIMMI II MMII MM 



AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 

I MM I I Ml M II MMMIIMM II MMI IMM 

AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 
TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 

MM M MM I IMIMMMM II I I Ml IMIMI I 

TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 
ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 282 0 

M MM I I MM MM J Mill MM IMMM IMM 

ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 282 0 



MMMMI MM II I MMM I, IMM I II II 



Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 



TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 294 0 

MIIIIIIIIIIIIIIIMIIIIIIIMIIIMIIIIIIIIMMIIIIIMIIIIIMI 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 
GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3000 

MIMM MIMIMM IMMMMMIM IMIMIIIMIIIIIIIIIIMM 

GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3 000 
TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 3 060 

IIIIIIIIIMIIIIIMIIMIIIIMIIIMIIIMIIMIIMIIIIIIIIIIMM 

TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 3060 
GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 3120 

IIIIMMIIIIIIIMIIIIIIIIIMIIMIIIIIMIIIIIMIIIIIIIIIIIMI 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 3120 
AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 3180 

IIIIIMMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIII 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 3180 
ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 

M II M:lll III I III II I! M II III U II INN 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 
GGTATGTATGGATTTAATCTAATCTAATAATTGTG CCCCGCAGTTGTG C CAAAGTG CATA 3300 

M MM I! MM Ml II MM IMM MMMIMM 

GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 33 00 
GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 

1 1 1 1 1 1 1 . 1 1 1 1 1 1 1 1 III IN 1 1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 1 1 Ml I ' 1 1 1 1 1 1 1 II I 

GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 
TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 3420 

IN II I IMII I il Mull II III II I >ll lllii 

TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 3420 
AGAGG CAG CGTGAAAG CAGATGAG CTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 3480 

II I! IMIMI I II MMI MIMM MM MM 

AGAGG CAG CGTGAAAG CAGATGAG CTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 34 80 
TGGTTTGATAAAG CAGTATTTGGGGTCATATTGTTTC CTGTGCTGGAG CAAAAGTCATTA 354 0 

M Ml: MM II 'I! Ill II II II III III Ml IN II II 

TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 
CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

IM II I M II I Ml MMI Mill II MMMIMM 

CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 
TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 

IIIIIIIIIIIIMIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIMIIMMIIIIII 

TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 



MIMIIIMIMIIIIIIIIIMIIIIMIIIIIMIIIIMIIMMIIIIIIIII 



Qy 3721 CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3780 

II III INI III II II II! MM III IIIIIIIIIMMI IIIMIIIMIIIi II! Ill 

Db 3721 CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3780 

Qy 3781 CATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 

1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3781 CAT CATATG CAGTGATATATGCCTATAATATAAG CCATAGGTT CACACCATTTTGTTTAG 384 0 

Qy 3841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3900 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 i 1 1 1 

Db 3841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3900 

Qy 3 901 ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3960 

I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3901 ATTCAGAAAGTCATAGATTTCTGAAGG CGT CAACGTG CATTTTATTTATGGACTGGTAAG 3 960 

Qy 3961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 

I f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 

Qy 4 021 AGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTG 4 080 

IMIIIIIIIIIMIIIIIIMIIMIIIIIIIMIIIMIIIIIIIMIIIIIIMIII 

Db 4 021 AGTAACTTTGTAGAAATGAGC CAGAAGCCAAGG CCCTGAGTTGGCAGTGGC CCATAAGTG 4 080 



Qy 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 
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Query Match 99.4%; Score 4079; DB 4; Length 4079; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4079; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

T CGCGG C CGCCTCTTG CGGTCCCAGAGTGGAGTGGAAGGTCTGGAG CTTTGGGAGGAGAC 6 4 

I Mi II II I II II I II III MM Mill III II III II III II Ml II III MM II II 

TCGCGGCCGCCTCTTG CGGTCCCAGAGTGGAGTGGAAGGTCTGGAG CTTTGGGAGGAGAC 6 0 
GGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCTCGCG 124 

i ii i ui iM'iini'ir I uNiiiM ini 'i ii mi 

GGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCTCGCG 12 0 

CGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTTCATC 184 

I M I I I M I I I I M I I I I I I I M I I I I I I II I I I I I I II I I M I I I I M I I I I I I I II II 
CGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTTCATC 18 0 

CATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAAGCAG 244 

1 1 1 1 II M 1 1 1 1 1 1 M 1 1 1 II M 1 1 1 M 1 1 1 1 M II II M I II 1 1 1 II I II 1 1 II 1 1 M I 

CATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAAGCAG 24 0 
TGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGCCGGA 304 

i 'ii i 1 i ii 'i iririMi ii i. ii ii i iiiim m 

TGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGCCGGA 300 
GCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTCTGGC 364 

I in mm m mini mi minium mini mi Milium minimi 

GCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTCTGGC 360 

CCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAAAAAA 424 

I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAAAAAA 42 0 

AGTGAAGGTGTAAAAGCAGC^CAAGTGCAATAAGAGATATTTCCTC^WVTTTGCCTCAAG 484 

M MMI Ml II I II II I Ml III I II II I II II I 

AGTGAAGGTGTAAAAGCAG(^CAAGTGCAATAAGAGATATTTCCTCAAATTTGCCTCAAG 4 80 



Qy 


b 


Db 


1 


Qy 


65 


Db 


61 


Qy 


125 


Db 


121 


Qy 


185 


Db 


181 


Qy 


245 


Db 


241 


Qy 


305 


Db 


301 


Qy 


365 


Db 


361 


Qy 


425 


Db 


421 



Qy 



48 5 ATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGT 544 



Db 


481 


Qy 


545 


Db 


541 


Qy 


605 


Db 


601 


Qy 


665 


Db 


661 


Qy 


725 


Db 


721 


Qy 


785 


Db 


781 


Qy 


845 


Db 


841 


Qy 


905 


Db 


901 


Qy 


965 


Db 


961 


Qy 


1025 


Db 


1021 


Qy 


1085 


Db 


1081 


Qy 


1145 


Db . 


1141 


Qy 


1205 


Db 


1201 


Qy 


1265 


Db 


1261 


Qy 


1325 



IMIIIIIIIIIIIIIMIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIMIIMIIII 

ATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGT 54 0 
GATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCACTTTT 604 

lllllll lllllllill MINI MINI Mllllllllll IMIIIIIMIIM 

GATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCACTTTT 600 
CGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCTACCC 664 

II II II Ml II II II II II I MINI II II II Mill MM II II I INI II II II II II 

CGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCTACCC 660 
AGCAATGGCTCAATGC^CAACTATTGCCC^CAGC^GACTAAAATTACTTC^GCTTTaWV 724 

MMMMMMMMIMIMMMMMMMMMIMMMMIMIMMIMM 

AGCAATGGCTC^TGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTTCAAA 72 0 
TACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACT 784 

1 1 1 1 1 1 i I i 1 1 1 1 1 1 I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 MMMMMIMIMI 

TACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACT 78 0 
CTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGATAGCC 844 

M II I II 1 1 M 1 1 1 1 1 1 1 1 1 II M II 1 1 II II I II I M M I II M I II 1 1 1 II I II 1 1 1 1 

CTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGATAGCC 84 0 
AGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAG 904 

ii ini in: ii m ii ii i ii ii hi :i in ii i 

AGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAG 900 

CTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTG 964 

I I I I I I I II I I II M I I II M I I II I I I I I II I I II I II I I II I II II I M I M II I M I 
CTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTG 960 

TTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGT 1024 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

TTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGT 1020 
GTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTG 1084 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II I II II II II 1 1 1 M 1 1 M I M II 1 1 1 1 M II 1 1 1 1 1 1 1 

GTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTG 1080 
GTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAA 1144 

I HIM IM II u III HI I MMHI Ml I IM II M 

GTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAA 114 0 
GCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATG 1204 

M II i II II M II. IMh II IMMI II MM IM I 

GCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATG 12 00 
CTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTC 1264 

I MMI IM M I Ml Ml I Mil II II I III II IM 

CTCAATGCCAC^TCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTC 1260 
GGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGT 1324 

MM II II I II MM II III MM IMMI II Mill MM lllllll MM MM II 1 1 

GGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGT 1320 
GAGATGTTGAACAGAAGGAATGG CAG CTTGAGAATTG CCCTCAGTGAACATCTTAAG CAG 1384 

II IMMI IM IM I IM IMMM I Ml Ml M MM II M 



r 



Db 


1321 


Qy 


1385 


Db 


1381 


Qy 


1445 


Db 


1441 


Qy 


1505 


Db 


1501 


Qy 


1565 


Db 


1561 


Qy 


1625 


Db 


1621 


Qy 


1685 


Db 


1681 


Qy 


1745 


Db 


1741 


Qy 


1805 


Db 


1801 


Qy 


1865 


Db 


1861 


Qy 


1925 


Db 


1921 


Qy 


1985 


Db 


1981 


Qy 


2045 


Db 


2041 


Qy 


Z 1 Ud 


Db 


2101 


Qy 


2165 


Db 


2161 



1321 GAGATGTTGAACAGAAGGAATGG CAG CTTGAGAATTG C CCTCAGTGAACATCTTAAGCAG 1380 



IMIMIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIMI 



CCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGA 15 04 

IMIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGA 15 00 

TGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATG 1564 

'! ii'Mi ii: ii ir i :n Mi M ii'M ;n !i ii i; i 

TGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATG 1560 

AATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTC 1624 

IIIMIIIIIMIIMIIIIIIIMIIMIIIIMIIIIIIIIIIIIIIIIMI IMIII 

AATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTC 162 0 

CAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCATG 1684 

IIIIIMIMIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII IMIII 

CAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCATG 1680 

AACGGAACAAGCATCCAGTGGAAGAACCACGATC^WU\CAACC^C^CACAGACCGGAGC 1744 

IIIMIIIIIIMIIMIIIIIIMIIIIIIIMIIIMIIIIIIIIIIIIIIIIIMII 

AACGGAACAAGCATCCAGTGGAAGAACCACG^ 174 0 

AGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAATC 18 04 

1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 i 1 1 1 1 1 1 1 1 i 1 1 ! 1 1 1 1 fl 1 1 1 1 1 1 1 1 M I II I II I 

AGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAATC 18 00 

CTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTTC 1864 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

CTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTTC 1860 

TTCCTTAATTOVCTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGACTGG 1924 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIII 

TTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGACTGG 192 0 



M II, M II M. II I II M II IM II II Mill Mi II 



1 1 1 1 [ 1 1 1 1 1 f I i I J 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 M 1 1 1 1 1 1 1 



GAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAG CTTT CAAGTACATGGCTAG 2104 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 M r 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M M 1 1 1 1 i 1 1 1 1 1 J 

GAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAGCTTTCAAGTACATGG CTAG 2100 



M 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGAT 
TAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATAT 

1 1 1 r i f 1 1 r r 1 1 1 1 1 1 1 1 r 1 1 1 j 1 1 1 r 1 1 1 1 1 r 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 



Qy 


2225 


Db 


2221 


Qy 


2285 


Db 


2281 


Qy 


2345 


Db 


2341 


Qy 


2405 


Db 


2401 


Qy 


2465 


Db 


2461 


Qy 


2525 


Db 


2521 


Qy 


2585 


Db 


2581 


Qy 


2645 


Db 


2641 


Qy 


2705 


Db 


2701 


Qy 


2765 


Db 


2761 


Qy 


2825 


Db 


2821 


Qy 


2885 


Db 


2881 


Qy 


2945 


Db 


2941 


Qy 


3005 


Db 


3001 



GGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAAT 2284 

M M : i M ii ii T'li: r ini: i: ii ii ;i ii in 

GGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAAT 228 0 
AGCCTAACATGATTATTTGAACTTATTTACAC^TAGTTTGAAAAAAAAAAGACAAAAATA 2344 

M M 1 1 1 M I M 1 1 1 Ml I! Mill III II II 1 1 1 1 II II 1 1 II I III II II 1 1 M I 

AGCCTAACATGATTATTTGAACTTATTTAC^C^TAGTTTGAAAAAAAAAAGACAAAAATA 234 0 
GTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACAC 24 04 

II I II III ImIIiMMmIMI I ill IN h Mill II 

GTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACAC 24 00 
AAATTCTAAAGCTACAACAAATACTACAGGCCC 2464 

IIIIIMIIIIIMIIMIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIMI 

AAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACATTTG 24 60 
GCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATGG 2524 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATGG 252 0 
TGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCZAAAAGTAATGCTTCA 2584 

IIIIIIIIIIMIIIIIIMIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTCA 258 0 
ATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAAAA 2644 

I Ml M'MMII I I M Mill II MIMI IM M MMI M 

ATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAAAA 264 0 



IIMIIMIMIIIIIMIIIIIIIMIIIIIMIIIMIIIIIIIIIIIMMIMI 



TGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATGA 2764 

IIIIIIIIMIIIIIIIIIIIIIIMMMIMIIIIIIIIIIIIIIMIIIIIMIIII 

TG CACTGTATATAGAAGTCTAAAACACAC CTAAGAGAAAAAGAT CGAATTTTTCAGATGA 2760 
TTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCACC 2824 

I III INI III III llllllllllllll llllllllll III MINN III MINIMI 

TTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCACC 282 0 
TCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTCC 2884 

MIIIMIIIIIIIMIMIIIIIIIIIIIIIIIIMIIIIIIIMIIMIMIIIMM 

TCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTCC 2880 
CTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAAA 2944 

I II M ! I II M'MMII MMI! IM I M IM IMIIMI 

CTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAAA 294 0 



MMM I Ml M i II II I II II M Ml II I II M,M 



TATGAACCTAACTCCC(^CCC(^(^TCTCCCTCCCACATTGTCACCATTTCAAAGGGCC 3064 

1 1 1 1 ! 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAGGGCC 3 060 



Qy 


3065 


Db 


3061 


Qy 


3125 


Db 


3121 


Qy 


3185 


Db 


3181 


Qy 


3245 


Db 


3241 


Qy 


3305 


Db 


3301 


Qy 


3365 


Db 


3361 


Qy 


3425 


Db 


3421 


Qy 


3485 


Db 


3481 


Qy 


3545 


Db 


3541 


Qy 


3605 


Db 


3601 


Qy 


3665 


Db 


3661 


Qy 


3725 


Db 


3721 


vY 


T7Q c 

J / O J 


Db 


3781 


Qy 


3845 


Db 


3841 



OVCAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAAA 3124 

IIIMIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAAA 312 0 
TCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATTT 3184 

MM lllllllllll IIIIIIIIIIIMIIIMIIIIIII lllllllllll Mill 

TCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATTT 3180 

TTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGTA 3244 

IIMIIIIIIIMMIIIIIIIillllMIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 
TTCTAGACTGTCT CTGTGGAATATATTTGTGTGTGTGATATATG CATGTGTGTGATGGTA 324 0 

TGTATGGATTTAATCTAATCTAATAATTGTG C CCCG CAGTTGTG CCAAAGTG CATAGTCT 3304 

IMIIIIIMIMIIIIIIIIIIMIIMIIIIIIIIIIIIIMMMIMIIIIIIIII 

TGTATGGATTTAATCTAAT CTAATAATTGTG C CCCG CAGTTGTG C CAAAGTG CATAGTCT 3300 
GAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGTA 3364 

IMIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIMIIIIIMIIIII 

GAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGTA 3360 
GCAACCTTCTG(^TT(^TAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGAG 3424 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIMIIMI 

G(^CCTTCTGCATTC^TAAATCTTGTAAT(^TGTTACCATTA(^yVTGGGATATAAGA^ 342 0 
GCAGCGTGAAAG CAGATGAG CTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGGT 3484 

I i ! i : 1 1 ' i ii i ' 1 1 1 1 1 ; i , i [ ' i 1 : 1 1 ' 1 1 mi i 'i i 

GCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGGT 3480 
TTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACACT 3544 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIMIIIIIIIIMIIIIIIIIIII 

TTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACACT 3 54 0 
TTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGTC 3604 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 

TTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGTC 3600 
TGATATTTCTTTCAGACTTCG C CAGACAGATTGCTGATAATAAATTAGGTAAGATAATTT 3664 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMI 

TGATATTTCTTTCAGACTTCG C CAGACAGATTGCTGATAATAAATTAGGTAAGATAATTT 3660 
GTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGCAAG 3724 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIMIMMIIIIIIIIIIIIII 

GTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGCAAG 372 0 
GCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCATC 3784 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIMIIIMIIIIIIIIIIIIIMI 

GCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCATC 3780 



II Ml MM IIMIII 1 1 1 lllllli IIIIIIIIMIIII III MM III Mil IIMM 



TTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATTC 3904 

M MMI I IM I Ml M II IMMI II MMM M II I I 

TTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATTC 3 900 



Qy 



3905 AGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAAC 3 964 



Db 


3901 


i i i i i t i i i i i i i i i i i i i i > i i i i i i i i i i i i i i t i i i i i i i i i i i > i i i i i i i i i i i i 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAAC 3960 


Qy 


3965 


TGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGTA 4024 

II IIMIII IIMIII III MM Ml IIIIIIIIIIMII III Mil Ml INI IIMII 

TGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGTA 4 02 0 


Db 


3961 


Qy 


4025 


ACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAA 4 083 

IllllllllllllllMllllllllllllllllllllllMIIIIIMMlllllllll 

ACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAA 4 07 9 


Db 


4021 



RESULT 3 

US-09-175-658B-20 

Sequence 20, Application US/09175658B 
Patent No. 6372900 
GENERAL INFORMATION: 
APPLICANT: METALLINOS , DANIKA 
APPLICANT: RINE, JASPER 
APPLICANT: BOWLING, ANN 

TITLE OF INVENTION: HORSE ENDOTHEL I N - B RECEPTOR GENE AND GENE PRODUCTS 
FILE REFERENCE: GOBR-110 

CURRENT APPLICATION NUMBER: US/09/175 , 658B 
CURRENT FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/062,562 
PRIOR FILING DATE: 1997-10-21 
NUMBER OF SEQ ID NOS : 25 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 2 0 
LENGTH: 1321 
TYPE: DNA 
ORGANISM: Horse 
US-09-175-658B-20 

Query Match 9.9%; Score 406.2; DB 4; Length 1321; 

Best Local Similarity 67.2%; Pred. No. 1.8e-95; 

Matches 630; Conservative 0; Mismatches 283; Indels 24; Gaps 3; 
Qy 717 CTTTCAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGA 776 

IIMIII Mill Mill II I II M I Ml I MM M I II I 

Db 310 CTTTCAAGTACATCAACACAGTAGTGTCCTGCCTAGTGTTCGTGCT 369 

Qy 777 ATGCAACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGC 836 

I I II MMI II MINIMI IIMIII 1 1 III MM Mill II 

Db 37 0 ACTC CACACTG CTGAGAATCATTTACAAGAACAAGTG CATG CGGAACGG CCCTAATATCT 42 9 

Qy 837 TGATAGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATG 8 96 

I II I Mill II II II IMIMI I I I M II I II I IIMIII 

Db 430 TGATCGCCAGCCTGGCTCTCCGAGACCTGCTGCAAATCATCATTGACGTCCCCATCAATG 48 9 

Qy 8 97 TATTTAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTT 956 

I I 1 1 1 1 1 1 1 E 1 1 1 ! I I IIMII IIMII II I I 

Db 4 90 TCTACAAGCTGCTGGCTGAGGACTGGCC CTTTGGAGT CGAGATGT 534 

Qy 957 GCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCG 1016 

I MUM I II II I MUM MM Mill Mill II II I Mill 

Db 535 GTAAGCTGGTGCCTTTCATACAGAAGGCCTCCGTGGGCATCACTGTGCTGAGTCTGTGTG 594 



Qy 1017 CTCTTAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGA 1076 

MM Ml MMMI M MM MMI MM I M M I MMM M 

Db 5 95 CTCTAAGTATTGACAGATATCGAGCTGTTGCTTCCTTGGAGCGAATTAAAGGAATTCGGG 654 

Qy 1077 TTCCTTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCA 1136 

1 1 1 1 II II I MINIM I II III I I II I I Mill 

Db 655 TTCCAAAATGGACAGCAGTAGAAATTGTTTTAATTTGGGTGGTCTCTGTGGTTCTGGCTG 714 

Qy 1137 TTCCTGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAA 1196 

I MMMM I M M I Ml I II III II I II II II 

Db 715 TCCCTGAAGCCGTGGGTTTTGATATGATTACCGCTGACTACAAAGGAAGTTATCTGCGAA 774 
Qy 1197 CCTGTATGCTCAATGCCAC ATCAAAATTCATGGAGTTCTACCAAGATGTAAAGG 1250 

Ml MM II MM I M MMM MM III I III MM 

Db 775 TCTGCCTGCTTCATCCCACTCAGAAAACAGCCTTCATGCAGTTTTACAAGAATGCTAAGG 834 

Qy 1251 ACTGGTGGCTCTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACA 1310 

I II I MIMIMIM MM MM MMM I II II I 

Db 835 ACTGGTGGCTATTTAGTTTCTATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATA 894 

Qy 1311 CCCTCATGACTTGTGAGATGTTGAACAGAAGGAATGG CAG CTTGAGAATTG C CCTCAGTG 1370 

II I Mill Mill MMMI I I II Mill II MMM I I II 

Db 895 CCTTGATGACCTGTGAAATGTTGAGAAAGAAGAGTGGCA TGCAAATTGCTTTAAATG 951 

Qy 1371 AACATCTTAAGCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTG 143 0 

I II I MMM I I 1 1 1 M 1 1 1 MMM MMM MM I I MM 

Db 952 ATCACTTAAAGCAGAGAAGGGAAGTGGCGAAAACAGTATTCTGCCTGGTCCTTGTCTTTG 1011 

Qy 1431 CTCTTTGCTGGTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAA 14 90 

I II MMM I II I III I 1 1 Mill I Ml I III I II 

Db 1012 CCCTGTGCTGGCTTCCTCTTCACCTCAGCAGGATTTTGAAACACACTCTTTATGATCAGA 1071 

Qy 14 91 TGGACAAGAACCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTA 1550 

II I MMMM I I II II II II I MM Mill M II I 

Db 1072 ATGATCCCCATAGATGTGAACTTTTGAGCTTTTTGTTGGTATTGGACTACATTGGCATCA 1131 

Qy 1551 ACTTGGCAACCATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAAT 1610 

M MM II MMMI II II II II MMMIMIMM MMMM I II 

Db 1132 ACATGGCCTCCCTGAATTCCTGCATTAATCCAATAGCTCTGTATTTGGTGAGCAAAAGAT 1191 

Qy 1611 TTAAAAATTGTTTCCAGTCATGCCTCTGCTGCTGCTG 164 7 

I Mill II II MM III I MMMM II 

Db 1192 TCAAAAACTGCTTTAAGTCGTGCTTATGCTGCTGGTG 1228 



RESULT 4 

US-09-016-434-1203 

; Sequence 1203, Application US/09016434 
; Patent No. 6500938 
; GENERAL INFORMATION: 

APPLICANT: Janice Au- Young 

APPLICANT: Jeffrey J. Seilhamer 

TITLE OF INVENTION: COMPOSITION FOR THE DETECTION OF SIGNALING 
TITLE OF INVENTION: PATHWAY GENE EXPRESSION 
NUMBER OF SEQUENCES: 14 90 
CORRESPONDENCE ADDRESS: 



ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 
STREET: 3174 PORTER DRIVE 
CITY: PALO ALTO 
STATE: CALIFORNIA 
COUNTRY : USA 
ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Word Perfect 6.1 for Windows /MS -DOS 6.2 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/016 # 434 

FILING DATE: HEREWITH 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME: Zeller, Karen J. 

REGISTRATION NUMBER: 37,071 

REFERENCE/DOCKET NUMBER: PA- 0002 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (650) 855-0555 

TELEFAX: (650) 845-4166 
INFORMATION FOR SEQ ID NO: 1203: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1470 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : s ingl e 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: GENBANK 

CLONE: gl82275 
US-09-016-434-1203 

Query Match 9.6%; Score 395.8; DB 4; Length 1470; 

Best Local Similarity 65.7%; Pred. No. 9.7e-93; 

Matches 634; Conservative 0; Mismatches 3 07; Indels 24; Gaps 3; 
Qy 689 TGCCCACAGCAGACTAAAATTACTTCAGCTTTCAA^ 748 



Db 



314 




Qy 



749 



ACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTACCAGAAC 808 



Db 



374 



CTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAAC 433 




Qy 



809 



AAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATC 868 



Db 



434 



AAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTG 4 93 




Qy 



869 




Db 



494 



Qy 92 9 GATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCG 988 

Mill I I II MINI I II II I Mill MM 

Db 55 0 ATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCC 598 

Qy 98 9 GTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCC 104 8 

Mill Mill II II I II II Mill III I IN II MM Mill 

Db 599 GTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCT 658 

Qy 104 9 TCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCC 1108 

II llllll I II I III MM II II I III. I 

Db 659 TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 718 

Qy 1109 ATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCC 1168 

I! Ill I I II I I Mill I II M M II II M II I II I I 

Db 719 ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 778 
Qy 1169 TTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCC ACATCAAAA 1222 

Ml II I M I I 1 M III MM II II I II 

Db 779 ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 838 

Qy 1223 TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 1282 

llllll MM III I I Ml II IN M Ml I MIMIIMM II 

Db 83 9 TTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 8 98 

Qy 1283 CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 1342 

II MM MIMI I II M II II Mill Mill llllll MM I 

Db 899 CCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG AGAAAG 955 

Qy 1343 AATGGCAG CTTGAGAATTG CCCTCAGTGAACATCTTAAG CAG CGTCGAGAAGTGG CAAAA 14 02 

II I II II Mill I I III II II MMM I II IIIIIMI III 

Db 956 AAAAGTGG CATGCAGATTG CTTTAAATGATCAC CTAAAG CAGAGACGGGAAGTGG CCAAA 1015 

Qy 14 03 ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGT 1462 

II M II Ml MM I I Mill II llllll I II llllll I III I 

Db 1016 ACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGG 1075 

Qy 14 63 ATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTC 1522 

M Mill III I Mill I I II II IIIIIMI I I II II 

Db 1076 ATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTT 1135 

Qy 1523 TTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCC 1582 

I II I MM II II Mill III MM I I! M II Mill 

Db 113 6 CTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCA 1195 

Qy 1583 ATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGC 1642 

II MIMIIMM II I III Mill II II 'II I llllll 

Db 1196 ATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGC 1255 

Qy 1643 TGCTG 1647 

II II 

Db 1256 TGGTG 1260 



RESULT 5 

US-08-910-864-13 

; Sequence 13, Application US/08910864 
; Patent No. 6280931 



; GENERAL INFORMATION: 

APPLICANT: SAKAMOTO, AIJI 
APPLICANT: HANAOKA, FUMIO 

TITLE OF INVENTION: METHOD FOR SPECIFICALLY AMPLIFYING A cDNA OF AN 
EXTREMELY 

TITLE OF INVENTION: SMALL QUANTITY 
NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: FISH & RICHARDSON P.C. 

STREET: 4225 EXECUTIVE SQUARE, SUITE 14 00 

CITY: LA JOLLA 

STATE : CA 

COUNTRY : USA 

ZIP: 92037 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/910 , 864 

FILING DATE: 13-AUG-1997 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 216506/1996 

FILING DATE: 16-AUG-1996 
ATTORNEY/AGENT INFORMATION: 

NAME: HAILE, LISA A. 

REGISTRATION NUMBER : 38,34 7 

REFERENCE/DOCKET NUMBER: 07898/017001 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 619-678-5070 

TELEFAX: 619-678-5099 
INFORMATION FOR SEQ ID NO: 13: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1873 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
FEATURE : 

NAME /KEY: CDS 

LOCATION: 231 . . 1556 
US-08-910-864-13 

Query Match 9.6%; Score 395.8; DB 3; Length 1873; 

Best Local Similarity 65.7%; Pred. No. l.le-92; 

Matches 634; Conservative 0; Mismatches 307; Indels 24; Gaps 3; 
Qy 68 9 TGCCCA-CAGCAGACTAAAATTACTTCAGCTTTCAi^ 748 

1 1 1 1 I II I II I IIIMIIIIIMI Mill II I II II 

Db 4 98 TGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATCAACACGGTTGTGTC 557 

Qy 74 9 ACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTACCAGAAC 8 08 

I I III I 1 1 1 1 II I Mill I II II II M II II Ml 1 1 1 1 1 

Db 558 CTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAAC 617 



Qy 8 09 AAATGTATGAGGAATGGCCCCAA.CGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATC 868 

II II III I II II Mill MM Mill I II II ' 1 lit I 

Db 618 AAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTG 677 

Qy 869 TATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTT 928 

I I 1 1 1 1 1 1 1 1 ! Ml II I lllllllllll I I MUM 

Db 678 CACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCC 733 

Qy 92 9 GATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCG 988 

Mill I I II llllll I II II I Mill 1 1 1 1 

Db 734 ATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCC 782 

Qy 98 9 GTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCC 104 8 

Mill Mill II II I II II Mill III IMIMI II MM Mill 

Db 783 GTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCT 842 

Qy 104 9 TCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCC 1108 

II llllll I II I MM MM II II I llllllll I 

Db 843 TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 902 

Qy 1109 ATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCC 1168 

II III I I II I I Mill I llllllll II II II I II I I 

Db 903 ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 962 
Qy 1169 TTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCC ACATCAAAA 1222 

I II II I II II II Ml MM II M I II 

Db 963 ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 1022 

Qy 1223 TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 12 82 

llllll MM III I I III II llllllll III I lllllllllll II 

Db 1023 TTC^TGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTATTCAGTTTCTATTTCTGCTTG 1082 

Qy 1283 CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 1342 

M MM llllll I II II II II Mill Mill llllll MM I 

Db 1083 CCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG AGAAAG 1139 

Qy 1343 AATGG CAG CTTGAGAATTG CCCTCAGTGAACATCTTAAG CAGCGTCGAGAAGTGGCAAAA 14 02 

M I II M Mill I I III II II llllll I II llllllll Ml 

Db 114 0 AAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAA 1199 

Qy 14 03 ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGT 14 62 

II II II III MM I I Mill M llllll I II llllll I III I 

Db 1200 ACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGG 125 9 

Qy 14 63 ATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTC 1522 

M Mill III I Mill MM II llllllll I I II II 

Db 1260 ATTCTGAAG CTCACTCTTTATAAT CAGAATGATC CCAATAGATGTGAACTTTTGAG CTTT 1319 

Qy 1523 TTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCC 1582 

I M I MM M M Mill III MM I IMIMI II M Mill 

Db 1320 CTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCA 1379 

Qy 1583 ATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGC 1642 

II Mill llllllll I Ml Mill II II llllllll I llllll 

Db 1380 ATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGC 143 9 

Qy 1643 TGCTG 1647 



Db 1440 TGGTG 1444 



RESULT 6 
US-08-121-446-3 

; Sequence 3, Application US/08121446 

; Patent No. 6313276 

; GENERAL INFORMATION: 

APPLICANT: IMURA, HIROO 

APPLICANT: NAKAO, KAZUWA 

APPLICANT: NAKANISHI , SHIGETADA 

TITLE OF INVENTION: A HUMAN ENDOTHELIN RECEPTOR 
NUMBER OF SEQUENCES: 4 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: MORRISON & FOERSTER 

STREET: 755 Page Mill Road 

CITY: Palo Alto 

STATE: California 

COUNTRY : USA 

ZIP: 94304-1018 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08 /12 1 , 446 

FILING DATE: 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/911,684 

FILING DATE: 10 -JUL- 19 92 
ATTORNEY/AGENT INFORMATION: 

NAME: CIOTTI, THOMAS E. 

REGISTRATION NUMBER: 21,013 

REFERENCE/DOCKET NUMBER: 29900-20324.00 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 813-5600 

TELEFAX: (415) 494-0792 

TELEX: 706141 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 43 01 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : s ingl e 

TOPOLOGY: linear 
FEATURE : 

NAME/ KEY: CDS 

LOCATION: 238.. 1566 
US-08-121-446-3 



Query Match 9.6%; 
Best Local Similarity 65.7%; 
Matches 634; Conservative 



Score 395.8; DB 4; Length 43 01; 
Pred. No. 1.7e-92; 
0; Mismatches 307; Indels 24; Gaps 3; 



Qy 68 9 TGCCCACAGCAGACTAAAATTACTTCAGCTTTC^^TACATTAACACTGTGATATCTTGT 74 8 



Db 505 TGCCA^GGACCCATCGAGATCAAGGAGACTTTCTU^TACATCAACACGGTTGTGTCCTGC 564 

Qy 74 9 ACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTACCAGAAC 8 08 

I I Ml I 1 1 1 1 II I Mill I II II II II II II III Mill 

Db 565 CTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAAC 624 

Qy 809 AAATGTATGAGGAATGG CC CCAACG CG CTGATAG C CAGTCTTG CCCTTGGAGACCTTATC 868 

II II III I II II Mill MM Mill I II II llllllll I 

Db 625 AAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTG 684 

Qy 869 TATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTT 92 8 

I I llllllll Illllllllllll I ! 1 1 1 1 1 i 1 1 1 1 I I MINI 

Db 68 5 CACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCC 74 0 

Qy 92 9 GATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCG 988 

Mill I I II llllll I II II I Mill MM 

Db 741 ATTTGGAG CTGAGATGTGTAAG CTGGTGC CTTTCATACAGAAAG CCTCC 789 

Qy 98 9 GTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCC 104 8 

1 1 1 1 1 Mill II II I II II Mill III lllllll II MM Mill 

Db 790 GTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCT 84 9 

Qy 104 9 TCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCC 1108 

II llllll I II I IIIIIIMI MM II II I MMMM I 

Db 850 TCTTGGAGTAGAATTAAAGGAATTGGGGTTC CAAAATGGACAG CAGTAGAAATTGTTTTG 909 

Qy 1109 ATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCC 1168 

II III I I II I I Mill I IIIIIIM II M II I II I I 

Db 910 ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 969 
Qy 1169 TTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCC ACATCAAAA 1222 

llllllll II II III MM II II I II 

Db 970 ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 102 9 

Qy 1223 TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 12 82 

llllll MM III I I III II llllllll III I lllllllllll II 

Db 103 0 TTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 1089 

Qy 1283 CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 1342 

II MM llllll I M II II II Mill Mill llllll MM I 

Db 1090 CCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG AGAAAG 114 6 

Qy 1343 AATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAAGCAGCGTCGAGAAGTGGCAAAA 14 02 

II I II II Mill I I III II II llllll I II llllllll Ml 

Db 114 7 AAAAGTGG CATG CAGATTG CTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGG CCAAA 12 06 

Qy 14 03 ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGT 1462 

II II II III MM I I Mill II llllll I M llllll I III I 

Db 12 07 ACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGG 1266 

Qy 14 63 ATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTC 1522 

II Mill III I Mill I I II II llllllll I I II II 

Db 12 67 ATT CTGAAG CTCACTCTTTATAATCAGAATGATC C CAATAGATGTGAACTTTTGAG CTTT 132 6 

Qy 1523 TTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCC 1582 

I II I MM II II Mill III MM I lllllll M II Mill 



Db 1327 CTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCA 1386 



Qy 



1583 ATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGC 1642 





Db 



1387 ATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGC 144 6 



Qy 



1643 TGCTG 1647 



Db 




RESULT 7 

PCT-US92-02091-1 

; Sequence 1, Application PC/TUS9202091 
; GENERAL INFORMATION: 

APPLICANT: Battey Jr., James F. 

APPLICANT: Cor jay, Martha H. 

APPLICANT: Feldman, Richard I. 

APPLICANT: Harkins, Richard N. 

TITLE OF INVENTION: RECEPTORS FOR BOMBES IN-LIKE PEPTIDES 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Edwin P. Ching 

STREET: 15 01 Harbor Bay Parkway 

CITY: Alameda 

STATE : CA 

COUNTRY : USA 

ZIP: 94501 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/02091 

FILING DATE: 19920313 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/426,150 

FILING DATE: 24 -OCT- 198 9 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/533,659 

FILING DATE: 05-JUN-1990 
ATTORNEY/AGENT INFORMATION: 

NAME: Ching, Edwin P. 

REGISTRATION NUMBER : 34 090 

REFERENCE/DOCKET NUMBER: A-0092C 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-266-7476 

TELEFAX: 415-266-7400 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1700 base pairs 

TYPE: NUCLEIC ACID 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 



HYPOTHETICAL: NO 

ORIGINAL SOURCE: 

ORGANISM: Mus musculus 
CELL LINE: Swiss 3T3 

IMMEDIATE SOURCE: 

LIBRARY: Lambda GT10 

FEATURE : 

NAME/ KEY: CDS 
LOCATION: 378 . . 1532 
PCT-US92-02091-1 



Query Match 2.5%; Score 101.8; DB 5; Length 1700; 

Best Local Similarity 49.3%; Pred. No. 2.1e-16; 

Matches 308; Conservative 0; Mismatches 302; Indels 15; Gaps 1; 



Qy 


714 


CAGCTTTCAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGG 

1 1 MM M 1 1 1 Ml 1 1 1 1 III II 1 1 II 1 1 1 

CGGGCTTCATCTATGTCATCCCTGCAGTTTATGGGCTTATCATCGTGATAGGTCTTATTG 


773 


Db 


490 


549 


Qy 


774 


GGAATGCAACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACG 

1 II II II III MM 1 II 1 III 1 III 1 II 1 II III 

GCAACATC^CGCTCATC^GATCTTCTGCACGGTCAAGTCCATGCGAAACGTGCCAAACC 


833 


Db 


550 


609 


Qy 


834 


CGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCA 

1 1 II 1 II II II 1 IMIIMI 1 MM 1 MM 1 

TGTTCATCTCTAGCCTGGCTTTGGGAGACCTGCTGCTGCTGGTGACATGCGCCCCTGTGG 


893 


Db 


. 610 


669 


Qy 


894 


ATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTC 

III III lllllll 1 MM llllll 1 II 

ATGCCAGCAAGTACCTGGCTGACAGGTGGCTA TTTGGCAGAATTG 


953 


Db 


670 


714 


Qy 


954 


TTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCT 

Mill III Illlllll 1 II 1 II llllll 1 1 III III II 

GCTGCAAACTGATCCCCTTTATACAACTTACTTCAGTGGGGGTGTCTGTCTTCACACTTA 


1013 


Db 


715 


774 


Qy 


1014 


GCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTG 

II M 1 Mi, III III 1 1 1 MM 1 

CGGCA.CTGTC^GCTGA(^GGTACAAAGCCATTGTACGGC(^TGGATATCCAGGCATCCC 


1073 


Db 


775 


834 


Qy 


1074 


GGATTCCTTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGG 


1133 


Db 


835 


1 II 1 llllll IMIIMI Mill III 

ATGCCCTGATGAAGATCTGTCTCAAAGCTGCTTTGATCTGGATTGTCTCTATGTTGTTGG 


894 


Qy 


1134 


CCATTCCTGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATA 

MM II II II 1 1 1 1 1 Mill 1 III MM 

CCATCCCAGAGGCTGTGTTTTCTGACCTCCACCCCTTCCATGTGAAAGATACCAACCAAA 


1193 


Db 


895 


954 


Qy 


1194 


AAACCTGTATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACT 

1 II II II 1 1 II III 1 III III 1 

CCTTCATTAGTTGTGCCCCCTACCCACACTCCAATGAGCTACACCCTAAAATCCATTCCA 


1253 


Db 


955 


1014 


Qy 


1254 


GGTGGCTCTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCC 

1 III IMIIMI M III 1 II 1 III MM 1 

TGGCTTCCTTTCTGGTTTTCTACGTTATCCCACTGGCGATCATCTCTGTCTACTACTACT 


1313 


Db 


1015 


1074 


Qy 


1314 


TCATGACTTGTGAGATGTTGAACAG 1338 





llllll 



Db 1075 TCATTGCCCGAAATCTGATTCAGAG 1099 



RESULT 8 

US-09-016-434-1215 

; Sequence 1215, Application US/09016434 
; Patent No. 6500938 
; GENERAL INFORMATION: 

APPLICANT: Janice Au- Young 

APPLICANT: Jeffrey J . Seilhamer 

TITLE OF INVENTION: COMPOSITION FOR THE DETECTION OF SIGNALING 
TITLE OF INVENTION: PATHWAY GENE EXPRESSION 
NUMBER OF SEQUENCES: 1490 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 

STREET: 3174 PORTER DRIVE 

CITY: PALO ALTO 

STATE : CALI FORNIA 

COUNTRY : USA 

ZIP : 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC -DOS /MS -DOS 

SOFTWARE: Word Perfect 6.1 for Windows /MS -DOS 6.2 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/016 , 434 

FILING DATE: HEREWITH 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME: Zeller, Karen J. 

REGISTRATION NUMBER: 37,071 

REFERENCE/DOCKET NUMBER: PA-0002 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (650) 855-0555 

TELEFAX: (650) 845-4166 
INFORMATION FOR SEQ ID NO: 1215: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1726 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : s ingl e 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: GENBANK 

CLONE: gl8364 9 
US-09-016-434-1215 

Query Match 2.2%; Score 91; DB 4; Length 1726; 

Best Local Similarity 48.3%; Pred. No. 1.3e-13; 

Matches 299; Conservative 0; Mismatches 305; Indels 15; Gaps 1; 

Qy 72 0 TCAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATG 77 9 

M II I I I III I I I I III I I I II I I II II 



Db 



514 TCCTCTATGTCATCCCTGCAGTTTATGGGGTTATCATTCTGATAGGCCTCATTGGCAACA 573 



Qy 78 0 CAACTCTG CT CAGGATCATTTACCAGAACAAATGTATGAGGAATGGC C C CAACG CGCTGA 83 9 

III II III I I I I II III I III I II I II III III 

Db 574 TCACTTTGATC^GATCTTCTGTACAGTCAAGTCCATGCGAAACGTTCCAAACCTGTTCA 633 

Qy 84 0 TAGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTAT 8 99 

I 1 1 1 i 1 1 1 II I MINIM II III I II I III 

Db 634 TTTCCAGTCTGGCTTTGGGAGACCTGCTCCTCCTAATAACGTGTGCTCCAGTGGATGCCA 693 

Qy 900 TTAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCA 95 9 

I I MMMI I MM MM I II I MM 

Db 694 GCAGGTAC CTGG CTGACAGATGG CTATTTGG CAGGATTGG C TGCA 738 

Qy 960 AGCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTC 1019 

I III 1 1 i 1 1 1 1 1 I III MM II III I I III III III II I 

Db 73 9 AACTGATCCCCTTTATACAGCTTACCTCTGTTGGGGTGTCTGTCTTCACACTCACGGCGC 7 98 

Qy 102 0 TTAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTC 107 9 

I I Mill MM III Ml I I II MM I 

Db 7 99 TCTCGGCAGACAGATACAAAGCCATTGTCCGGCCAATGGATATCCAGGCCTCCCATGCCC 858 

Qy 108 0 CTTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTC 1139 

II I I I II I II MINIMI I III I I MM! M 

Db 8 59 TGATGAAGATCTGCCTCAAAGCCGCCTTTATCTGGATCATCTCCATGCTGCTGGCCATTC 918 

Qy 114 0 CTGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCT 1199 

I II M I I I I I Mill II I I II I I 

Db 919 CAGAGGCCGTGTTTTCTGACCTCCATCCCTTCCATGAGGAAAGCACCAACCAGACCTTCA 978 

Qy 12 00 GTATGCTCAA.TGCCACATCAAAATT 125 9 

II I III I I Ml I III I I I I 

Db 97 9 TTAGCTGTGCCCCATACCCACACTCTAATGAGCTTCACCCCAAAATCCATTCTATGGCTT 1038 

Qy 12 60 TCTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGA 1319 

III I III I I II II II I II III MM I MM 

Db 1039 CCTTTCTGGTCTTCTACGTCATCCCACTGTCGATCATCTCTGTTTACTACTACTTCATTG 1098 

Qy 1320 CTTGTGAGATGTTGAACAG 1338 

II I II I I II 

Db 1099 CTAAAAATCTGATCCAGAG 1117 



RESULT 9 

PCT-US92-02091-3 

Sequence 3, Application PC/TUS9202091 
GENERAL INFORMATION: 

APPLICANT: Battey Jr., James F. 
APPLICANT: Cor jay, Martha H. 
APPLICANT: Feldman, Richard I. 
APPLICANT: Harkins , Richard N. 

TITLE OF INVENTION: RECEPTORS FOR BOMBESIN -LIKE PEPTIDES 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Edwin P. Ching 
STREET: 1501 Harbor Bay Parkway 
CITY: Alameda 



STATE : CA 

COUNTRY : USA 

ZIP: 94501 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/02 091 

FILING DATE: 19920313 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/426,150 

FILING DATE: 24-OCT-1989 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/533,659 

FILING DATE: 05-JUN-1990 
ATTORNEY/AGENT INFORMATION: 

NAME: Ching, Edwin P. 

REGISTRATION NUMBER: 34090 

REFERENCE/DOCKET NUMBER: A-0092C 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-266-7476 

TELEFAX: 415-266-7400 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1726 base pairs 

TYPE: NUCLEIC ACID 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL: NO 
ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 

CELL TYPE: Small cell lung carcinoma 

CELL LINE: NCI-H345 
IMMEDIATE SOURCE: 

LIBRARY: Lambda GT10 
FEATURE : 

NAME/ KEY: CDS 

LOCATION: 3 99.. 1553 
PCT-US92-02091-3 



Query Match 2.2%; Score 91; DB 5; Length 1726; 

Best Local Similarity 48.3%; Pred. No. 1.3e-13; 

Matches 299; Conservative 0; Mismatches 305; Indels 15; Gaps 1; 

Qy 72 0 TC^WVTAC^TTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATG 77 9 

I I II I I I III I I I I III I I I II I I II II 
Db 514 TCCTCTATGTCATCCCTGCAGTTTATGGGGTTATCATTCTGATAGGCCTCATTGGCAACA 573 



Qy 780 CAACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGA 839 

III II III 1 1 1 1 II III I III I II I II III Ml 

Db 574 TCACTTTGATCAAGATCTTCTGTACAGTCAAGTCC 633 

Qy 84 0 TAGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTAT 8 99 



I M II I IMIIIII II III I II I III 

Db 634 TTTCCAGTCTGGCTTTGGGAGACCTGCTCCTCCTAATAACGTGTGCTCCAGTGGATGCCA 693 

Qy 900 TTAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCA 959 

I I 1 I I I I I I I I I II I I I I I II I I I I I 

Db 694 GCAGGTACCTGGCTGACAGATGGCTATTTGGCAGGATTGGC TGCA 738 

Qy 960 AGCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTC 1019 

I III IMIIIII I III 1 1 1 1 II III I I III III III II I 

Db 73 9 AACTGATCCCCTTTATACAGCTTACCTCTGTTGGGGTGTCTGTCTTCACACTCACGGCGC 7 98 

Qy 102 0 TTAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTC 107 9 

I I Mill MM III III I I I I MM I 

Db 7 9 9 TCTCGGCAGACAGATAC^AAGCCATTGTCCGGCCAATGGATATCCAGGCCTCCCATGCCC 8 58 

Qy 108 0 CTTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTC 113 9 

III I I II I II MINIMI I III I I llllllllll 

Db 85 9 TGATGAAGATCTGCCTCAAAGCCGCCTTTATCTGGATCATCTCCATGCTGCTGGCCATTC 918 

Qy 114 0 CTGAAG CGATTGG CTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAG CATAAAACCT 1199 

I II II I I I I I Mill II I I II I I 

Db 919 CAGAGGCCGTGTTTTCTGACCTCCATCCCTTCCATGAGGAAAGCACCAACCAGACCTTCA 978 

Qy 12 00 GTATGCTCAATGCC^CATCAAAATTC^TGGAGTTCTACC^GATGTAAAGGACTGGTGGC 1259 

II I II I I I III I III I I I I 

Db 979 TTAGCTGTGCCCCATACCCACACTCTAATGAGCTTCACCCCAAAATCCATTCTATGGCTT 1038 

Qy 12 60 TCTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGA 1319 

Ml I III I I II II II I II III MM I MM 

Db 1039 CCTTTCTGGTCTTCTACGTCATCCCACTGTCGATCATCTCTGTTTACTACTACTTCATTG 1098 

Qy 132 0 CTTGTGAGATGTTGAACAG 1338 

II I II h I II 

Db 1099 CTAAAAATCTGATCCAGAG 1117 



RESULT 10 
US-09-120-772-1 

; Sequence 1, Application US/09120772 
; Patent No. 6143521 
; GENERAL INFORMATION: 

APPLICANT: LANE, PAMELA 

APPLICANT : TSUI , PING 

APPLICANT: ELSHOURBAGY, NABIL 

TITLE OF INVENTION: HUMAN BOMBESIN RECEPTOR SUBTYPE 
TITLE OF INVENTION: 3 
NUMBER OF SEQUENCES: 2 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Ratner & Prestia 

STREET: P.O. Box 98 0 

CITY: Valley Forge 

STATE : PA 

COUNTRY : USA 

ZIP: 19482 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 



OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/120 , 772 
FILING DATE: 22-JUL-1998 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY/ AGENT INFORMATI ON : 
NAME: Prestia, Paul F 
REGISTRATION NUMBER: 23,031 
REFERENCE/DOCKET NUMBER: GP-70505 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 610-4 07-0700 
TELEFAX: 610-4 07-0700 
TELEX: 846169 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1205 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-09-120-772-1 

Query Match 1.9%; Score 79.6; DB 3; Length 1205; 

Best Local Similarity 46.9%; Pred. No. 9.9e-ll; 

Matches 419; Conservative 0; Mismatches 439; Indels 36; Gaps 4; 
Qy 761 GTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGG 82 0 

Mill II I II Mill I III III 1 1 III I III Mill III 

Db 180 GTGGGCATCCTTGGAAATGCTATTCTCATCAAAGTCTTTTTCAAGACa^TCCATGO^ 23 9 

Qy 821 AATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATT 880 

I I II II III MM II II Mill; III I I II I I I 

Db 24 0 ACAGTTCCAAATATTTTCATCACCAGCCTGGCTTTTGGAGATCTTTTACTTCTGCTAACT 299 

Qy 881 GATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGAC 94 0 

I I II I III I I I II II I I MM II 

Db 300 TGTGTGCCAGTGGATGCAACTCACTACCTTGCAGAAGGATGGCTGTTC 347 

Qy 941 TTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACC 1000 

^ II I II III III II II I II I I I Ml II II I I 

Db 348 GGAAGAATTGGTTGTAAGGTGCTCTCTTTCATCCGGCTCACTTCTGTTGGTGTGTCA 4 04 

Qy 1001 GTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGT 1060 

II Ml I III II I MM MM 1 1 [ 1 1 1 1 I 

Db 4 05 GTGTTCAC^TTAGCAATTCT(^GCGCTGACAGATAC^GGCAGTTGTGAAGCCACTTGAG 464 

Qy 1061 GTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTG 1120 

Ml II III I M II II I MMMM II 

Db 4 65 CGACAGCCCTCCAATGCCATCCTGAAGACTTGTGTAAAAGCTGGCTGCGTCTGGATCGTG 524 

Qy 1121 TCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGG 118 0 

M I II I II I Mill II II I I I III I II 

Db 52 5 TCTATGATATTTGCTCTACCTGAGGCTATATTTTCAAATGTATACACTTTTCGAGATCCC 584 



Qy 1181 GGTGAACAGCATAAAACCTGTATGCTC^ 124 0 

I II I II II I I I I II III III 

Db 585 AATAAAAATATGACATTTGAATCATGTACCTCTTATCCTGTCTCTAAGAAGCTCTTGCAA 644 

Qy 1241 GATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCG 1300 

II II I II 1 1 1 1 I I I I III II I I I 

Db 64 5 GAAATACATTCTCTGCTGTGCTTCTTAGTGTTCTACATTATTCCACTCTCTATTATCTCT 704 

Qy 1301 ATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATT 1360 

III III II I II II I I I II II I ll I 

Db 705 GTCTACTATTCCTTGATTGCTAGGACCCTTTACAAAAGCACCCTGAACATACCTACTGAG 764 
Qy 1361 GCCCTCAGTGAACATCTTAAGCAG CGTCGAGAAGTGGCAAAAACAGTTTTC 1411 

I I II I I IIIIMI II I I II I III II II 

Db 765 GAACAAAGCCATGCCCGTAAGCAGATTGAATCCCGAAAGAGAATTGCCAGAACGGTATTG 824 

Qy 1412 TGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGTATATTGAAG 1471 

Mill I I Mill II Mill ! II 1 1 1 1 I I III I I 

Db 82 5 GTGTTGGTGGCTCTGTTTGCCCTCTGCTGGTTGCCAAATCACCT - - CCTGTACCTCTACC 882 

Qy 1472 AAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTCTTACTGCTC 1531 

II I I II III II II II II I II I 

Db 883 ATTCATTCACTTCTCAAACCTATGTAGACCCCTCTGCCATGCATTTCATTTTCACCATT- 941 

Qy 1532 ATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCCATAGCTCTG 15 91 

II I Mill II Mill II 1 1 1 1 1 1 1 1 I Mill 

Db 942 TTCTCTCGGGTTTTGGCTTTCAGCAATTCTTGCGTAAACCCCTTTGCTCTC 992 

Qy 1592 TATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGCTGC 1645 

II I IIIIMI I II I II Ml I I II III III 

Db 993 TACTGGCTGAGCAAAAGCTTCCAGAAGCATTTTAAAGCTCAGTTGTTCTGTTGC 1046 



RESULT 11 

US-09-016-434-1275 

; Sequence 1275, Application US/09016434 

; Patent No. 6500938 

; GENERAL INFORMATION: 

APPLICANT: Janice Au- Young 

APPLICANT: Jeffrey J. Seilhamer 

TITLE OF INVENTION: COMPOSITION FOR THE DETECTION OF SIGNALING 
TITLE OF INVENTION: PATHWAY GENE EXPRESSION 
NUMBER OF SEQUENCES: 14 90 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 

STREET: 3174 PORTER DRIVE 

CITY: PALO ALTO 

STATE : CALI FORNIA 

COUNTRY : USA 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Word Perfect 6.1 for Windows /MS-DOS 6.2 
CURRENT APPLICATION DATA: 



APPLICATION NUMBER: US/09/016,434 

FILING DATE: HEREWITH 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME: Zeller, Karen J. 

REGISTRATION NUMBER: 37,071 

REFERENCE/DOCKET NUMBER: PA- 0002 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (650) 855-0555 

TELEFAX: (650) 845-4166 
INFORMATION FOR SEQ ID NO: 1275: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1413 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : s ingl e 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: GENBANK 

CLONE: g291876 
US-09-016-434-1275 

Query Match 1.9%; Score 79.6; DB 4; Length 1413; 

Best Local Similarity 46.9%; Pred. No. l.le-10; 

Matches 419; Conservative 0; Mismatches 439; Indels 36; Gaps 4; 
Qy 761 GTGGGAATGGTGGGGAATG CAACTCTGCTCAGGAT CATTTAC CAGAACAAATGTATGAGG 82 0 

Mill II I II Mill I III III II III I III Mill III 

Db 327 GTGGGCATCCTTGGAAATGCTATTCTCATCAAAGTCTTTTTCAAGACCAAATCCATGCAA 386 

Qy 821 AATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATT 880 



Db 



387 ACAGTTCCAAATATTTTCATCACCAGCCTGGCTTTTGGAGATCTTTTACTTCTGCTAACT 44 6 




Qy 



881 GATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGAC 94 0 

I I II I Ml I I I II II I I MM II 

44 7 TGTGTGCCAGTGGATGCAACTCACTACCTTGCAGAAGGATGGCTGTTC 494 



Db 



Qy 



941 TTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACC 1000 

II I II III III II Ml II I I I I II MM I I 

4 95 GGAAGAATTGGTTGTAAGGTGCTCTCTTTCATCCGGCTCACTTCTGTTGGTGTGTCA 551 



Db 



Qy 



1001 GTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGT 1060 

II Ml I Ml II I II II II MM lllllll I 

552 GTGTTC^CA.TTAACAATTCTCAGCGCTGACAGATACAAGGCAGTTGTGAAGCCACTTGAG 611 



Db 



Qy 



1061 GTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTG 112 0 



Db 




Qy 



1121 TCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGG 1180 
M I M I II I Mill II II I I lllllll 

672 TCTATGATATTTGCTCTACCTGAGGCTATATTTTCAAATGTATACACTTTTCGAGATCCC 731 



Db 



Qy 1181 GGTGAACAGCATAAAACCTGTATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAA 124 0 

I II I II II I I I I II III II! 

Db 732 AATAAAAATATGACATTTGAATCATGTACCTCTTATCCTGTCTCTAAGAAGCTCTTGCAA 791 

Qy 1241 GATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCG 1300 

II II I II 1 1 1 1 I I I I III II I II 

Db 792 GAAATACATTCTCTGCTGTGCTTCTTAGTGTTCTACATTATTCCACTCTCTATTATCTCT 851 

Qy 1301 ATCTTCTACACCCT CATGACTTGTGAGATGTTGAACAGAAGGAATGG CAG CTTGAGAATT 1360 

III III II I II II I I I II II I II I 

Db 852 GTCTACTATTCCTTGATTGCTAGGACCCTTTACAAAAGCACCCTGAACATACCTACTGAG 911 

Qy 1361 GCCCTCAGTGAACATCTTAAGCAG CGTCGAGAAGTGGCAAAAACAGTTTTC 1411 

I I II I I MM || | | || I III II II 

Db 912 GAACAAAG CCATG CCCGTAAG CAGATTGAATCCCGAAAGAGAATTGCCAGAACGGTATTG 971 

Qy 1412 TGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGTATATTGAAG 1471 

Mill I I Mill II llllllll II MM I I III I I 

Db 972 GTGTTGGTGGCTCTGTTTGCCCTCTGCTGGTTGCCAAATCACCT- - CCTGTACCTCTACC 102 9 

Qy 14 72 AAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTCTTACTGCTC 1531 

II I I II III II II II II I II I 

Db 103 0 ATTCATTCACTTCTCAAACCTATGTAGACCCCTCTGCCATGCATTTCATTTTCACCATT - 1088 

Qy 1532 ATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCCATAGCTCTG 1591 

II I Mill II Mill II llllllll I Mill 

Db 1089 TTCTCTCGGGTTTTGGCTTTCAGCAATTCTTGCGTAAACCCCTTTGCTCTC 1139 

Qy 1592 TATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGCTGC 1645 

II I Mill I II I || Ml I I || IN 1 1 1 

Db 114 0 TACTGGCTGAGCAAAAGCTTCCAGAAGCATTTTAAAGCTCAGTTGTTCTGTTGC 1193 



RESULT 12 

US-09-016-434-1233 

; Sequence 1233, Application US/09016434 

; Patent No. 6500938 

; GENERAL INFORMATION: 

APPLICANT: Janice Au- Young 

APPLICANT: Jeffrey J. Seilhamer 

TITLE OF INVENTION: COMPOSITION FOR THE DETECTION OF SIGNALING 
TITLE OF INVENTION: PATHWAY GENE EXPRESSION 
NUMBER OF SEQUENCES: 1490 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 

STREET: 3174 PORTER DRIVE 

CITY: PALO ALTO 

STATE: CALIFORNIA 

COUNTRY : USA 

ZIP : 94304 
COMPUTER READABLE FORM: 
'MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC -DOS/MS -DOS 

SOFTWARE: Word Perfect 6.1 for Windows/MS-DOS 6.2 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/016 , 434 



FILING DATE: HEREWITH 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME: Zeller, Karen J. 

REGISTRATION NUMBER: 37,071 

REFERENCE/DOCKET NUMBER: PA- 0002 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (650) 855-0555 

TELEFAX: (650) 845-4166 
INFORMATION FOR SEQ ID NO: 1233: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1352 base pairs 

TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

L I BRARY : GENBANK 

CLONE: gl89241 
US-09-016-434-1233 

Query Match 1.7%; Score 68.6; DB 4; Length 1352; 

Best Local Similarity 50.1%; Pred. No. 7.6e-08; 

Matches 210; Conservative 0; Mismatches 194; Indels 15; Gaps 1; 
Qy 752 ATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTACCAGAACAAA 811 

II Ml MINI II 1 1 1 1 II I III I I 1 1 1 1 I I 1 1 1 1 

Db 2 96 ATCATC^CCGTGGGCTTGCTGGGC^CATCATGCTGGTGAAGATCTTCATCACC^CAGC 355 

Qy 812 TGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATCTAT 871 



Db 



356 GCCATGAGGAGCGTCCCCAACATCTTCATCTCTAACCTGGCGGCCGGGGACTTGCTGCTG 415 




QY 



872 GTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTTGAT 931 



Db 





II II 

TTCGAC 4 60 



Qy 



932 CACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCGGTG 991 



Db 




Qy 



992 GGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCCTCC 1051 



Db 




Qy 



1052 TGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCCATC 1111 

I I III III II II II I III 

581 CCCATGGACATGCAGACGTCAGGGGCATTGCTGCGGACCTGTGTGAAGGCCATGGGTATC 64 0 



Db 



Qy 



1112 TGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCCTT 1170 



Db 




RESULT 13 
PCT-US92-02091-7 

Sequence 7, Application PC/TUS9202091 
GENERAL INFORMATION: 

APPLICANT: Battey Jr., James F. 
APPLICANT: Cor jay, Martha H. 
APPLICANT: Feldman, Richard I. 
APPLICANT: Harkins, Richard N. 

TITLE OF INVENTION: RECEPTORS FOR BOMBESIN-LI KE PEPTIDES 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Edwin P. Ching 
STREET: 1501 Harbor Bay Parkway 
CITY: Alameda 
STATE : CA 
COUNTRY : USA 
ZIP : 94501 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/02 091 
FILING DATE: 19920313 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/426,150 
FILING DATE: 24-OCT-1989 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/533,659 
FILING DATE: 05-JUN-1990 
ATTORNEY/AGENT INFORMATION: 
NAME: Ching, Edwin P. 
REGISTRATION NUMBER: 34 090 
REFERENCE/DOCKET NUMBER: A-0092C 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-266-7476 
TELEFAX: 415-266-74 00 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1352 base pairs 
TYPE: NUCLEIC ACID 
STRANDEDNESS : doubl e 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL: NO 
ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 
CELL TYPE: Small cell lung carcinoma 
CELL LINE: NCI-H345 
FEATURE : 

NAME /KEY: CDS 
LOCATION: 140 . . 1312 
PCT-US92-02091-7 



Query Match 



1.7%; Score 68.6; DB 5; Length 1352; 



Best Local Similarity 50.1%; Pred. No. 7.6e-08; 

Matches 210; Conservative 0; Mismatches 194; Indels 15; Gaps 1; 



Qy 752 ATTTTC^TCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATC^TTTACC^GAACT^A 811 

II III MINI II 1 1 1 1 II I Ml I I MM I I MM 

Db 2 96 ATCATCACCGTGGGCTTGCTGGGCAACATCATGCTGGTGAAGATCTTCATCACCAACAGC 355 

Qy 812 TGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATCTAT 871 

1 1 1 1 1 II I M II I II I II I I II II II Ml I I 

Db 356 GCCATGAGGAGCGTCCCCAACATCTTCATCTCTAACCTGGCGGCCGGGGACTTGCTGCTG 415 

Qy 872 GTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTTGAT 931 

II Ml MM I II II Mill || II 

Db 416 CTGCTCACCTGCGTCCC GGTGGACGCCTCGCGCTACTTCTTCGAC 460 

Qy 932 CACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCGGTG 991 

I MUM I Mill III MM I * I III Ml III 

Db 4 61 GAGTGGATGTTTGGCAAGGTGGGCTGCAAACTGATCCCTGTCATCCAGCTCACTTCCGTG 52 0 

Qy 992 GGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCCTCC 1051 

Ml I MM Ml III II II II I llllllllllllll I I I 

Db 521 GGGGTTTCCGTGTTCACTCTCACTGCCCTCAGCGCCGACAGGTACAGAGCCATCGTTAAC 580 

Qy 1052 TGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCCATC 1111 

I I III III II II II I III 

Db 581 CCCATGGACATGCAGACGTCAGGGGCATTGCTGCGGACCTGTGTGAAGGCCATGGGTATC 64 0 

Qy 1112 TGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCCTT 1170 

III I I III I I Mill MM MUM I I I III I II 

Db 641 TGGGTGGTCTCCGTGTTGCTGGCAGTTCCCGAAGCGGTGTTTTCAGAAGTGGCTCGCAT 699 



RESULT 14 
PCT-US92-02091-5 

; Sequence 5, Application PC/TUS9202091 
; GENERAL INFORMATION: 

APPLICANT: Battey Jr., James F. 

APPLICANT: Cor jay, Martha H. 

APPLICANT: Feldman, Richard I. 

APPLICANT: Harkins, Richard N. 

TITLE OF INVENTION: RECEPTORS FOR BOMBESIN-LI KE PEPTIDES 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Edwin P. Ching 

STREET: 1501 Harbor Bay Parkway 

CITY: Alameda 

STATE : CA 

COUNTRY : USA 

ZIP: 94501 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/02091 

FILING DATE: 19920313 



CLASSIFICATION : 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/426,150 

FILING DATE: 24-OCT-1989 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/533,659 

FILING DATE: 05-JUN-1990 
ATTORNEY/AGENT INFORMATION: 

NAME: Ching, Edwin P. 

REGISTRATION NUMBER: 34 090 

REFERENCE/DOCKET NUMBER: A-0092C 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-266-7476 

TELEFAX: 415-266-7400 
; INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1584 base pairs 

TYPE: NUCLEIC ACID 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHET I CAL : NO 
ORIGINAL SOURCE: 

ORGANISM: Rattus rattus 

TISSUE TYPE: Esophagus 
FEATURE : 

NAME/ KEY: CDS 

LOCATION: 132 . . 1304 
PCT-US92-02091-5 

Query Match 1.6%; Score 65.8; DB 5; Length 1584; 

Best Local Similarity 49.6%; Pred. No. 4.4e-07; 

Matches 209; Conservative 0; Mismatches 197; Indels 15; Gaps 1; 



Qy 752 ATTTTCATCGTGGGAATGGTGGGGAATGC^CTCTGCTCAGGATCATTTACC^GAACAAA 811 

II M Mill M I I I I II I III I I III I I | | | | 
Db 288 ATCATCTCGGTGGGCTTGCTGGGC^CATCATGCTGGTGAAGATATTCCTCACCAACAGC 34 7 

Qy 812 TGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATCTAT 871 

IN Ml M Mil III II II II MINIM I 

Db 348 ACCATGCGGAGTGTCCCCAACATCTTCATCTCTAACCTGGCTGCGGGAGACCTGCTGCTG 4 07 

Qy 872 GTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTTGAT 931 

M I I I I I I I Ml I II I IIIMI 

Db 4 08 CTGCTGACCTGCGTCCCAGTGGATGCCTCC CGATACTTCTTTGAT 452 

Qy 932 CACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCGGTG 991 

I I M Ml I Mill II MM I Ml 1 1 1 1 1 1 1 1 

Db 453 GAATGGGTGTTCGGCAAGCTGGGCTGCAAACTCATCCCAGCCATCCAGCTCACCTCGGTG 512 

Qy 992 GGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCCTCC 1051 

Ml I MM III III II II II I MIMIIIIIIIIII I I I 

Db 513 GGGGTTTCCGTGTTCACTCTCACGGCCCTCAGCGCTGACAGGTACAGAGCTATCGTGAAC 572 

Qy 1052 TGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCCATC 1111 

I I Ml III I II II II II MM 

Db 573 CCCATGGACATGCAGACGTCTGGTGTGGTGCTGTGGACCAGTTTGAAGGCCGTGGGCATC 632 



Qy 



1112 



TGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCCTTT 1171 



Db 



633 




Qy 



1172 



G 1172 



Db 



693 



G 693 



RESULT 15 
US-08-279-590A-1 

; Sequence 1, Application US/08279590A 

; Patent No. 5656749 

; GENERAL INFORMATION: 

APPLICANT: ELIOT R. SPINDEL # SRINIVASA 

APPLICANT: NAGALLA AND BRENDA BARRY 

TITLE OF INVENTION: NUCLEIC ACIDS ENCODING 

TITLE OF INVENTION: RECEPTORS FOR BOMBESIN-LI KE 

TITLE OF INVENTION: PEPTIDES 

NUMBER OF SEQUENCES: 10 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson 

STREET: 22 5 Franklin Street 

CITY: Boston 

STATE: Massachusetts 

COUNTRY: U.S.A. 

ZIP: 02110-2804 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5" Diskette, 1.44 Mb 

COMPUTER: IBM PS/2 Model 50Z or 55SX 

OPERATING SYSTEM: MS-DOS (Version 5.0) 

SOFTWARE: WordPerfect (Version 5.1) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/279 , 590A 

FILING DATE: July 22, 1994 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: No. 5656749e 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
; NAME: Y. Rocky Tsao 

REGISTRATION NUMBER: 34,053 

REFERENCE/DOCKET NUMBER: 00537/098001 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (617) 542-5070 

TELEFAX: (617) 542-8906 

TELEX: 2 00154 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1563 

TYPE: nucleic acid 

STRANDEDNESS : s ingl e 

TOPOLOGY: linear 
US-08-279-590A-1 



Query Match 



1.5%; Score 63.6; DB 1; Length 1563; 



Best Local Similarity 45.5%; Pred. No. 1.6e-06; 

Matches 332; Conservative 0; Mismatches 374; Indels 24; Gaps 2; 



Qy 735 CTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGA 794 

I I I I I II I I I I I I I II II II I II II III M I I 

Db 34 0 CTGTCACATATGCTGTTATTATTTCTGTCGGTATCCTTGGAAACACAATACTTATAAAAG 399 

Qy 7 95 TCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCC 854 

llll II III I III I I II II II I II I I I I II II 

Db 4 00 TATTTTTTAAAATCAAGTCAATGCAGACTGTTCCTAATATTTTCATCACCAGCCTGGCTT 459 

Qy 855 TTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTG 914 

Illllll llll llll I II I I I Ml I III 

Db 460 TTGGAGATCTTCTTCTACTGCTGACCTGCGTGCCAGTGGACGCATCTCGGTATATTGTGG 519 

Qy 915 GGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTT 974 

III llll II II I III I I llll 

Db 52 0 ACACGTGGATGTTTGGAAGAGCTGGCTGT AAGATAATTTCCTTCA 564 

Qy 975 TGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGT 1034 

I III llll II II I I II I I I III III llllllll 

Db 565 TACAGCTTACCTCTGTCGGAGTGTCGGTGTTTACTTTAACTGTCCTCAGTACTGACAGGT 624 

Qy 1035 ACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCA 1094 

IMIIII III I I III I llll II 

Db 625 ACAGAGCCATTGTGAAACCCTTGCAATTGCAGACCTCAGATGCCGTTTTGAAGACATGTG 684 

Qy 1095 TTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCT 1154 

II III I I HIM I Ml I MM II III Mill I M 

Db 685 GCAAAGCTGTTTGTGTTTGGATCATTTCCATGCTCCTCGCTGCTCCAGAAGCTGTGTTCT 744 

Qy 1155 TCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCCA 1214 

I II llll II II I I I I 

Db 745 CAGATTTGTATGAATTTGGCAGCTCGGAAAAAAATACCACTTTTGAAGCCTGTGCTCCAT 804 

Qy 1215 CATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATT 1274 

II I III Mill I I I llll llll 

Db 805 ATCCAGTCTCTGAAAAGATTCTGCAAGAGACACATTCCCTAATATGCTTCCTGGTATTCT 864 

Qy 1275 TCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGA 1334 

I I I III II II II I III III II I III I 

Db 865 ACA.TTGTTCCCCTGTC^TCATTTCTGCATATTACTTCCTTATTGCAAAAACCCTGTACA 924 

Qy 1335 A CAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAAGCAGC 1385 

I II I I II I I I I II MM I I II 

Db 925 AAAGTACTTTCAACATGCCTGCTGAAGAGCACACTCACGCCCGAAAACAGATAGAATCGC 984 

Qy 1386 GTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCC 144 5 

I I llllllllllll II II Mill Illllll I llllllll I 

Db 985 GCAAACGAGTGGCAAAAACTGTGTTGGTGTTGGTGGCATTGTTCGCAGTGTGCTGGTTGC 1044 

Qy 144 6 CTCTTCACTT 1455 

I III I 

Db 1045 CAAACCACAT 1054 



Search completed: December 13, 2003, 00:00:12 



Job time : 254.613 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



December 12, 2003, 10:54:32 ; Search time 1202.3 Seconds 

(without alignments) 
11347.756 Million cell updates/sec 

US-09-931-157-1 
4105 

1 gaattcgcggccgcctcttg taaaagtttacagaaacctt 4105 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



2201672 seqs, 1661799599 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



4403344 



Post -processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database 



Published__Applications_NA: * 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq: * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq: 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB.seq: 

10 : /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB . seq 

11: /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB. seq 

12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq: 

13 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB . seq2 

14 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB . seq 

15 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB. seq: * 

16 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB. seq: * 

17 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 

18 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB.seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



: * 
: * 

* 
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ALIGNMENTS 



RESULT 1 
US-09-931-157-1 

; Sequence 1, Application US/09931157 

; Patent No. US20020082414A1 

; GENERAL INFORMATION: 

; APPLICANT: Imura, Hiroo 



APPLICANT: Nakao, Kazuwa 

APPLICANT: Nakanishi, Shigetada 

TITLE OF INVENTION: Human Endothelin Receptor 

FILE REFERENCE: 299002032411 

CURRENT APPLICATION NUMBER : US/09/931,157 

CURRENT FILING DATE: 2001-10-15 

PRIOR APPLICATION NUMBER: 08/121,446 

PRIOR FILING DATE: 1993-09-14 

PRIOR APPLICATION NUMBER: 07/911,684 

PRIOR FILING DATE: 1992-07-10 

PRIOR APPLICATION NUMBER: JP 3-172828 

PRIOR FILING DATE: 1991-07-12 

NUMBER OF SEQ ID NOS : 2 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 1 
LENGTH: 4105 
TYPE: DNA 

ORGANISM: Homo Sapiens 



FEATURE : 
NAME/KEY 
LOCATION 
NAME /KEY 
LOCATION 
NAME/ KEY 
LOCATION 



CDS 

(485) . . . (1768) 
sig_peptide 
(485) . . . (544) 
mat_peptide 
(545) . . . (1768) 



US-09-931-157-1 

Query Match 100.0%; Score 4105; DB 9; Length 4105; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4105; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 6 0 

I Ml M II MIMIMI II II I Ml I II IMM Mill 

GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 6 0 
AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

IIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIMII 

AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 
CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 18 0 

I - I ! : ,'ih : I II i ■ M I 

CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 180 
CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

MIMM I II I MM M I MM MIMI IIMM 

CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 240 
GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 300 

1 1 1 1 1 i 1 1 1 1 ! I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 300 
CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

M MM I I IMM'MI II I M III MM MM Mill II M 

CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 
TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 

IMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIMIMIMI 
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Db 


61 


Qy 
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Qy 
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Db 
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Qy 
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Db 
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Qy 
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Db 
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Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Ov 


• 1141 


Db 


1141 


Qy 


1201 


Db 


1201 



361 TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 
AAAAAGTGAAGGTGTAAAAG CAGCACAAGTG CAATAAGAGATATTTC CTCAAATTTGCCT 4 8 0 

IMIIIMMIIIMIIMIIMIIMIMIIIMIIIII MIIMMMIMII III 

AAAAAGTGAAGGTGTAAAAG CAGCACAAGTG CAATAAGAGATATTTC CTCAAATTTGCCT 48 0 
CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

ini i iii iii i iiiiii ii 1 ii iiiimi i in iimi : 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 
CAGTGATAAT CCTGAGAGATACAG CACAAATCTAAG CAATCATGTGGATGATTT CACCAC 600 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIII 

CAGTGATAATCCTGAGAGATACAG CACAAATCTAAG CAATCATGTGGATGATTT CACCAC 600 
TTTTCGTGGC^CAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIMIIMIIIMIIIIMIMI 

TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 
ACCCAG CAATGG CTCAATG CACAACTATTG CC CACAG CAGACTAAAATTACTT CAG CTTT 72 0 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIMIIIIIIMIIIIIIIIIMI 

ACCCAGCAATGG CTCAATG CACAACTATTGCCCAC^ 72 0 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 

IIIIIIIIIIMIIIIIIIIIIMIIIIMIIIIIIIIIIIIMMIIIIIIIIIIIIII 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 
AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMMIIIIII 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 
AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

IIIIIIIIIIIIIIMIIIMIIIMIIIMMIIIIIIMIIIIIMIIIIIIIMIII 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 
TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

IIIIIIMIIIIIIIIIIMIMIIIIIMIIIIIMIIIMIIIIIIIIIMIIIIIII 

TAAG CTGCTGG CTGGGCG CTGGC CTTTTGATCACAATGACTTTGGCGTATTT CTTTG CAA 960 



IIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIMIIIIIIIMIIMII 



TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

IIIIIIIIIIIIIIIIMMIIMIIMIMIIIIMIIIMIIIIIIIIIMIIIIIII 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 108 0 
TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

M MUM IIIIII' i ! II, II Mill M III Mill; Ih II 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 
TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

I Ml M,MMMI II I II II M M III MM I Ml II 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 1200 
TATG CTCAATG CCACAT CAAAATT CATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 

M I II ;,i , ,, 1 | , |, ' | 

TATGCTCAATGCCAC^TOU^TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 
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Db 
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Db 
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1861 


Db 


1861 


Qy 


1921 


Db 
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1981 


Db 


1981 


Qy 


2041 


Db 


2041 



CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

II li 1 1 : E 1 1 1 1 1 1 1 [ : [ 1 1 : 1 1 1 1 : 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 138 0 

I II IM Ml MM li M IMMMIM MMIIMM 

TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 1380 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

M :i ! l l II ! II II I I 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

II II II II M I I IMIIIII I L li H|| Mill; |,|| 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

I II IM I I IMIIIIMI MM MM M IMMMIM 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 



IIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 



TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

IIIIIIIIIIIIIIIIIIIIIIIMIMMIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 168 0 
CATGAACGGAACAAGCATCCAGTGGAAGAAC 174 0 

II II Mi I I MilllMI II I III Ml llllllll 'I'll I 

CATGAACGGAACAAGCATCCAGTGGAAGAA 174 0 



IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIMII 



AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

M MM II I II II IMMM M MM IIIIIIIMIMIM 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 



MM IMM! I MMI II MM! M llllllll MM I 



I M J 1 1 1 1 M M I M I M I J M M I M 1 1 M I M 1 1 1 M II M 1 1 M I M I M I M M M 



TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 040 

M MMMMI M Ml IMM M I M !MMI II I I I 

TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 204 0 
AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 

II M MUM MIMMMMMMIM MMI Mill IMIMM 

AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 
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CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 

; ; i i , ! . i 1 1 , : : ; I ! : I 1 1 

CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 



IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMMIIIIIIIMMIIIIIIIIMI 



2220 



ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 2280 

I Ml I MMM Ml II II M'MMII M IMM II I II 

ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 2280 
TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 

M Ml MM MMI II II II II II M' IM I II I II 

TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 
AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 

M M lUMIh I Mill; MM II II II Mi I II I II 

AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 



M M iM IMM II IM M il II II M II. I II I II 



TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 252 0 

MMI M II II M II MMMMMMM II Mill I M 

TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 252 0 
ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2580 

! I i i 1 1 1 1 ! ' ! 1 1 I 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2 580 
TTCAATCAGATAGTTCTTTTTC^CAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 2 64 0 

I I IM I Ml IMIM II Ml IIMMIMI I II III III MM 

TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 2 64 0 
AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 

M I IM I II IM II II II IM IMIM I Ml Ml III MM 

AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 



I IM I IMIMMI II II Mi IMIM I II Ml IM MM 



MINI IIIIIIIIIIIMIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIMIII 



CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 288 0 

IIMIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 288 0 
TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 

1 1 1 1 ' 1 1 1 1 1 1 Ml 1 1 1 1 1 1 II I II III I III III II 1 1 1 1 1 1 1 1 II 1 1 1 MM III 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 
2 941 GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3000 
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II M, I Ml II I Ml IM II IM II M M IMMM 

GAAAAATAATTACCCAC^^TGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3000 
TC^GTATGAACCTAACTCCCCACCCC^CATCTCCCTC 3060 

IMIIMIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

TCAGTATGAACCTAACTCCCCACCCC^ACATCTCCCTCCCACATTGTCACCATTTCAAAG 3 060 
GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 3120 

IM II I MUM I II III Mill MM III Mill II M 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 3120 
AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 3180 

II II M IMMM I II Ml IMM IM Ml I MM II I 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 3180 
ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 3240 

IMM M II II I IM II II M II Ml : IM II 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 
GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 3300 

I III MMI IMM IM II II M II Ml II II MM II 

GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 33 00 
GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 

II II M MM II M MM II III Mill II. MMI Ml Ml 

GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 33 60 
TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 3420 

M M i II Ml I II IMM I II III IMM. IMM 

TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 3420 
AGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 34 80 

Mill MMI IM I IM II IMM IIMM M II IM I 

AGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 34 80 
TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 

IIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 3540 
CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

MIIMIIIIIIIIMMIIIIIIIIMMIIIIIIMIIIIIMIIIIMIIIIIIIII 

CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 
TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 

IIIIIIIIMIIIIIIIIIMIIIIIIIIIMIIIIIIMIIIIMIMIIIIIMIIII 

TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 
ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 3720 

II 1 1 1 IMM 1 1 1 1 1 1 1 1 1 1 M I II Ml MM I I II 1 1 1 1 III II II III 1 1 1 Ml II 

ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 3720 
CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3780 

MIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIII 

CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3780 
CATCATATG CAGTGATATATG CCTATAATATAAG CCATAGGTTCACACCATTTTGTTTAG 384 0 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 



Db 



3781 CATCATATG CAGTGATATATGCCTATAATATAAG CCATAGGTTCACACCATTTTGTTTAG 384 0 



Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


UD 


3 y d ± 


Qy 


4021 


Db 


4021 


Qy 


4081 


Db 


4081 



MINI II I II! Ill IIIIIIMIIIII III III MINI IIIIIIIMI III III III 

ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAj 
ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAI 

MINI III III III II I IIIIM Mill Ml IIIIIIIMI Ml Ml Ml 

ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAA( 
TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAAG 

I II I M M: I Ml II Mill MM I III II II I I M 

TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAAG 
AGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTt 

MIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIII 

AGTAACTTTGTAGAAATGAG CCAGAAGCCAAGGCCCTGAGTTGG CAGTGG CC CATAAGT( 
TAAAATAAAAGTTTACAGAAACCTT 4105 

IIIIIMIIIIIIIIIMIIIIili 



RESULT 2 

US-10-007-926A-229 

Sequence 229, Application US/10007926A 
Publication No. US20030143539A1 
GENERAL INFORMATION: 
APPLICANT: BERTUCCI , FRANCOIS 
APPLICANT: HOULGATTE, REMI 
APPLICANT: BIRNBAUM, DANIEL 
APPLICANT: NGUYEN, CATHERINE 
APPLICANT: VI ENS , PATRICE 
APPLICANT: FERT, VINCENT 

TITLE OF INVENTION: GENE EXPRESSION PROFILING OF PRIMARY BREAST CARCINOMAS 
TITLE OF INVENTION: USING ARRAYS OF CANDIDATE GENES 
FILE REFERENCE: 1546-R-00 

CURRENT APPLICATION NUMBER: US/10/007 , 926A 
CURRENT FILING DATE: 2001-12-07 
PRIOR APPLICATION NUMBER: 60/254,090 
PRIOR FILING DATE: 2000-12-08 
NUMBER OF SEQ ID NOS : 468 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 229 
LENGTH: 4105 
TYPE : DNA 

ORGANISM: Homo sapiens 
FEATURE : 

OTHER INFORMATION: endothel in receptor type a (EDNRA) gene. 
US-10-007-926A-229 

Query Match 100.0%; Score 4105; DB 13; Length 4105; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4105; Conservative 0; Mismatches 0; Indels 0; Gaps C 
Qy 1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 6 0 

MM M I I M' II M II M I MMl II IMM I I II 



Db 



1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 60 



Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


UXJ 


/Zl 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 



AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 120 

llllllllll IIIIIIMIII IIIIIIMIIIIIIIIIIIIIIIIIIIIMII1IIIII! 

AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 
CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 180 

IIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIMIIMIIMIIIMIIIIIIIIIII 

CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 18 0 
CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

IIMIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMII 

CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 
GCAGTG CC CAGGAAGTTTTCTGAAGCCGGGGAAGCTGTG CAG CCGAAGCCG CCGC CG CGC 300 

M I Ml II II. II I Ml I II Mi I MM I Ml I M 

GCAGTG CCCAGGAAGTTTTCTGAAGC CGGGGAAGCTGTGCAG CCGAAGCCG CCGC CG CGC 3 00 
CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

IIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 
TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 420 

IIIIMIMIMIIIIMII llllllll IMIMII MINI llllllll MM 

TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 420 
AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 4 8 0 

II I II MIMI II II II I! II II II III II II! I M 

AAAAAGTGAAGGTGTAAAAGC^GC^CAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 4 80 
CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

' I I I ■ M . Mi i II, I ! ! ; 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 
CAGTGATAATCCTGAGAGATACAGCACAAATCTAA 600 

IIIIIIMIIIIIMIIIIMIIIIMIIIMIIIIIMIIIIIIMMIIIIMIIIM 

CAGTGATAATCCTGAGAGATACAGCAC^AATCTAAGCAATCATGTGGATGATTTCACCAC 600 
TTTTCGTGGC^CAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

III III INI Mill 1 1 INI i 1 1 1 1 1 MINI I llllllllll MM 

TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATC^ACCCACTAATTTGGTCCT 660 
ACCCAGCAATGGCTCAATGCACAACTATTGC 72 0 

I I Ml Ml IM II I II I IM IM I II I INM I 

ACCCAGCAATGGCTCAATGCACAACTATTGCC^ 72 0 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 

I I Ml Ml II IM I II I Ml IM I II I MIMM I 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 
AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

MIMIIIIIIIIIIIMIIMIIIIIIMIIIIIIMIIIIIIMIIIIMIIIIIMI 

AACTCTG CTCAGGATCATTTAC CAGAACAAATGTATGAGGAATGG CC CCAACG CG CTGAT 84 0 
AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

M I Ml Ml II MIMI Ml I Ml MM IM I MMMi'i 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 



Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 



IMM M M II M M M II I! MM Ml II M M MM 



GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 1020 

I II II: I. MM II I I IIMI I I: IMII I il Ml 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 1020 



MIMI M MMMIMM' MMl I! MM Ml II M! M MM 



TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

I i Ml I M 1 1 1 1 ! 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 [ 1 1 1 1 ! 1 1 [ I 1 1 1 M 1 1 1 1 1 1 1 Mill 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 
TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

IIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIII 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 
TATGCTCAATGCCAC^Tayy^TTCATGGAGTTCTACC^AGATGTAAAGGACTGGTGGCT 12 60 

IMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIII 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 
CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 1320 

M II 1 M IMIIMI I I II II MM II III MM II II 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 



MIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIII 



GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 15 00 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIII 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 15 00 
CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

IIMIIIIIMIMMIMMIIIIMIIMIIIIIIIIIIIIIIMIIMIIMIMI! 

C CGATGTGAATTACTTAGTTT CTTACTG CTCATGGATTACATCGGTATTAACTTGG CAAC 1560 
CATGAATTCATGTATAAACCCC^TAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 162 0 

Mill I M Ml IMII II MIMI I I IIMIIIIMI Ml M 

CATGAATTCATGTATAAACCC CATAG CTCTGTATTTTGTGAG CAAGAAATTTAAAAATTG 162 0 
TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 168 0 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIMIIMIIIIMIMII 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 
CATGAACGGAACAAGCATCC^GTGGAAGAACCACGATCAAAACAACCACAACAC^GACC^ 174 0 

MMIIIIIIIIIMIIMIIIIMIIIIIMIIIIIIIIIMIMIIMIIIIIIIIII 

CATGAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAAC 174 0 



Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 



GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

IIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIII 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 18 00 
AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIMIMIIIIIIMII 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 
CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 

MIIIIIIIIIIIIIIIMIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 



MIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIII 



TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 

Ml M M I ! l M I I II II II! ! M IM I II M 

TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 
AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 

I Ml II I' I 11 II! I I II II I I II I! M M MM 

AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 
CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 

I II II II M !! Ill ! I II IM I I 'Mill M II III 

CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 
AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 

I Ml Ml I I MMIU M'MI II I I MMI II IMll! 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 
ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 2280 

I MMMMMMMMMMMMMMIMMIMM MIIMI MMIIIMI 

ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 2280 
TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 

MMI M : M. II II i I II II MM! II II MM II 

TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 
AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 

IMM IM MM Ml II M I II II M. I II M II lull 

AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 
ACACAAATTCTAuAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGT 24 60 

1 1 II I i I! 1 1 1 1 1 1 M 1 1 III 1 1 1 1 MM III I M II 1 1 M I II I M I III I 1 1 1 II 

ACAC^U\ATTCTAAAGCTACAACAAATACTA(^GGCCCTTAAAGCACAGTCTGATGACACA 24 60 



MIMMIIIIIIN llllllllllll MM lllllll llllllllllll MM 



ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2580 

' ; ^ : ' 1 I . ' 1 ' ' ! ' , I ' 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2580 

2581 TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 



Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 



II MMM II Ml II II IMM IMMI II M II II 

TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 2640 
AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 

llilllllll IMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AAAATCAATGTCAAGTACCT^ 2700 
TC^GTGC^CTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 

IMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 

TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 
ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 2820 

IMIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIMIIIIIIM 

ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 2820 
CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 2880 

MMM I MM IM M II II [Mil II IMM 1 IM I! M 

CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 2880 
TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 

1 1 1 M 1 1 M 1 1 1 1 ! 1 1 1 1 1 1 1 1 ! I M 1 1 [[ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 294 0 



IIIMIIIIIIIIIIIIIIIMIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIII 



TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 3060 

M II! MMM II M II II MMI II Ml II III II M 

TCAGTATGAACCTAACTCCCC^CCCCAACATCTCCCTCCCAC^TTGTCACCATTTC^\AAG 3 060 
GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 3120 

IIMIIMMIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 3120 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 3180 
! I J ' I I i I .11 ' I I : : I : I I II II I III if 1 1 ! j I I [ I ' . 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 3180 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 

MMIIII Mllllllll MIIIINMIIII IMIIIIIIIII lllllillll I 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 
GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGC^GTTGTGCCAAAGTGCATA 3300 

I II M Ml MMM Ml II II M IM !! ! II IM I 

GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 33 00 
GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 

IIIIIIIIIIIIIMIIIIIIIMIIIIIIMMIMIIIIIIIIIIIIMMIIIIMI 

GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 
TGTAGC^CCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 3420 

M. M I IM II M II I! II M II Ml! MMM IM II 

TGTAGC^ACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 3420 
AGAGGCAG CGTGAAAGCAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGT 3480 

II M I IMMIMM II Ml Ml I II IM IMIM IM II 



Db 



3421 AGAGG CAG CGTGAAAG CAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGT 34 80 



Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 




Qy 


4021 


Db 


4021 


Qy 


4081 


Db 


4081 



TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 

III lllllll III IIIMIMIIIMI Ml MINI! I MIIIIIIMM lllllll II 

TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 
CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

I MIMI I MMM M M I Mill MMIMM (I IMII M • 

CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 
TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 

MIMIIMIIIIMIIIMIIMIMIIIMIIMIIIIIIIIMIIIMIIIIIIIII 

TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 
ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 372 0 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIMIIMIIIIIMIIII 

ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 372 0 
CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 378 0 

IIIIIIIIIIIIIIIMIMIMMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 378 0 
CATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 

IIIIIMIIIIIMIIIMIIIMIIIMIIIIIIIIIIIMIIIIIIIMIIIIIIIII 

CATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 
ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3900 

1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3 900 
ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3 960 

MIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIMIMIIIIIIIIMIIMIMII 

ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3 960 
TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 402 0 

1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j i j r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r i 

TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 402 0 
AGTAACTTTGTAGAAATGAG CCAGAAGCCAAGGCC CTGAGTTGG CAGTGG CCCATAAGTG 4 08 0 

IMIIMIIIIIIIIIIIIMMIIMIMIIIMIIIIIIIIMIIIIIIMIIMIII 

AGTAACTTTGTAGAAATGAG CCAGAAGCCAAGG CC CTGAGTTGG CAGTGG CCCATAAGTG 4 08 0 



I 



RESULT 3 

US-10-101-510-370 

; Sequence 370, Application US/10101510 

; Publication No. US200301482 95A1 

; GENERAL INFORMATION: 

; APPLICANT: WAN, JACKSON 

; APPLICANT: WANG, YIXIN 

; TITLE OF INVENTION: EXPRESSION PROFILES AND METHODS OF USE 
; FILE REFERENCE: 15117.0012 

/ CURRENT APPLICATION NUMBER: US/ 1 0/1 01 , 510 



CURRENT FILING DATE: 2002-03-20 
PRIOR APPLICATION NUMBER: 60/276,947 
PRIOR FILING DATE: 2001-03-20 
NUMBER OF SEQ ID NOS : 805 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 370 
LENGTH: 4105 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-101-510-370 

Query Match 100.0%; Score 4105; DB 13; Length 4105; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4105; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

GAATTCG CGG CCG CCTCTTG CGGTCCCAGAGTGGAGTGGAAGGTCTGGAG CTTTGGGAGG 6 0 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIII 

GAATTCG CGGCCG CCTCTTG CGGTCCCAGAGTGGAGTGGAAGGTCTGGAG CTTTGGGAGG 6 0 
AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIMIIMMMIIIIIII 

AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 180 

I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 180 

CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

IIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIII 

CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 
GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 300 

Mllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 300 
CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIMMMIIIIIMIIMI 

CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 
TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 

IIMIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 420 
AAAAAGTGAAGGTGTAAAAG CAG CACAAGTGCAATAAGAGATATTTC CTCAAATTTG CCT 4 80 

IM IM I II Ml I II II II M II Ml MiMI Ml M 

AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 4 80 
CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

III Mil Ml II! I II II Ml MINIM MM Ml MM MM MINN MM III 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 
CAGTGATAAT CCTGAGAGATACAG CAC7W\.TCTAAGCAATCATGTGGATGATTTCAC CAC 600 

1 1 1 m 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 e 1 1 1 1 1 1 1 1 r j 1 1 1 1 1 1 1 1 1 1 1 1 1 

C^GTGATAATCCTGAGAGATACAGCACAAATCTAAGCAAT 600 
TTTTCGTGGCAC^GAGCTCAGCTTCCTGGTTACC^CTCATCAACCCACTAATTTGGTCCT 660 

IIIIIIIIIIMIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 



Qy 


i 


Db 


i 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 



Db 



601 TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 



Qy 


661 


ACCCAGCAATGGCTCAATGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTT 

Ml M 1 Mill M 1 Ml 1 1 1 MIMI MMMIMI 

ACCCAGCAATGGCTCAATGCACAACTATTGCCC^C^GCAGACTAAAATTACTTCAGCTTT 


720 


Db 


661 


720 


Qy 


721 


CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 

IIIMIMIIIIIIIIMIIIIIIIMIIIIIIIIIMIIMIIIIIIIIIIIIIIIIM 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 


780 


Db 


721 


780 


Qy 
Db 


781 
781 


AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 

IIIIMIIIMMIIIIIIIIIIIIMIIIIIMIIIMIIIIMIIIIIIMIIIIIII 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 


840 
840 


Qy 


841 


AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIMIIIIIIIMIIIIIMIMIIMII 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 


900 


Db 


841 


900 


Qy 


901 


TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 

IIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 


960 


Db 


901 


960 


Qy 


961 


GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 

IIIMIIIIIIIIIMMIIIIIIMIIIMIIMIIIIIIIIIIIIIIIIIIIIIIMI 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 


1020 


Db 


961 


1020 


Qy 


1021 


TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 

IIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIMIMIIIIIIIIIMIIIIIIIIII 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 


1080 


Db 


1021 


1080 


Qy 


1081 


TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 

IIMIMIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIMIIIIIIIIIIIIIIIMM 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 


1140 


Db 


1081 


1140 


Qy 


1141 


TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 

i m mi ,i u n 'i l i nirii ii ii ii iiiiiM 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 


1200 


Db 


1141 


1200 


Qy 


1201 


TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 

IIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIMM 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 


1260 


Db 


1201 


1260 


Qy 


1261 


CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 

IM IM M 1 1 1 M 1 M M II II IMMMI MM 1 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 


1320 


Db 


1261 


1320 


Qy 


1321 


TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 

Ml IM 1 h IM 1 IM llllll II II IMI IM 

TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 


1380 


Db 


1321 


X ._) O \J 


Qy 


1381 


G CAG CGTCGAGAAGTGG CAAAAACAGTTTTCTG CTTGGTTGTAATTTTTG CTCTTTG CTG 

Ml M 1 1 M M 1 1 IMMIMIMM MM II IMIM 

GCAGCGTCGAGAAGTGG CAAAAACAGTTTTCTG CTTGGTTGTAATTTTTG CTCTTTG CTG 


1440 


Db 


1381 


1440 


Qy 


1441 


GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 

M MM 1 M M 1 M Mill M IMM IMI IM M M 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 


1500 


Db 


1441 


1500 



Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681' 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2 041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 



CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

I I I I ! : I I I I II I I I I I I IMIill III II I I I I I I I I I I I I I M I I I I I I I II I I I 
CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 1620 

II M MM IM II I II Ml M II MM IM M II M M 

CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 1620 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

I I I I I I I M i M I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I ! I I I I I I I I I i I 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

1681 CATGAACGGAACAAGCATCGAlGTGGAA 174 0 

IIIIIIIIIIIIIMIIIMIIMIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIII 

CATGAACGGAAC^GCATCCAGTGGAAGAACCACGATCAAA 174 0 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 18 00 

IIMIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIMIIMIIIIIIIII 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 18 00 
18 01 AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

IM I MM II M II II Ml II II II II I Ml M M 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 
CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 

IIIIIIIIIIIMIIMIIIMIIMIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIII 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 1920 
CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 198 0 

I M II II IMM Ml IMM II II MMM II II 

CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 1980 
TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 204 0 

MIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIM 

TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 204 0 
AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 

MIIIMIIMIIIIMIIIIIIIIMIIIIMIIIIMIIMIIIIMIIMIIIMM 

AAATGAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATGG 2100 
CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 

M I II Ml II II Ml IM MM IM II II MM II Ml 

CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 
AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 2220 

M I Ml Ml IMM MM II M II II MUM II IM 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 2220 
ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 2280 

IMM II II II IM II II M IMIIIMM M IMMM ! 

ATATGGG CTCAGGTCACTTTTATTTGAAATGT CATTTGGTG C CAGTATTTTTTAACTG CA 2280 
TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 

I M II IMMIMI Ml Ml IMM II II I MM II IM 

TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 



Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Ov 


O U O -L 


Db 


3061 


Qy 


3121 


Db 


3121 



AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 

I I Mi III I MIMMMMM Mill Ml lll M II IIMI 

AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 
ACACAAATTCTAAAGCTACAACAAATACT 24 60 

IIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

ACACAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACA 24 60 
TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 2 520 

II I II II I II I II MM IM MMI M MIMM I 

TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 2 52 0 
ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2 58 0 

MIIIIIIIIMMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIMIII 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2580 
TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 

I IMM IM;! MIMM ll I IliMI M II M II II II 

TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 
AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2 700 

II II III II I M M MIMM II II M IMM MIMM II 

AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2 700 
TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 

MIMIII IMIM MMM II II MIMI MMI Mi IM IIMI 

TCAGTG CACTGTATATAGAAGTCTAAAACACAC CTAAGAGAAAAAGAT CGAATTTTT CAG 2 760 
ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 2 82 0 

I IIMI II I II Ml II II MMM M II Ml IM II II 

ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 2 82 0 

< ^~^" ^i" T* ^r" 'T""^""T^"T'*r" ^i"*T'"T n * J ^r''^"*T*"T' T' T'T**"^ T""^"^^F^^T , ^P T"T"^r~^T" < r''T"^P^^ *T~^"^P T*'^^^*^ T* T' "T^"T'"T^"T > *T* < ^^r""T* 2880 

CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 2880 



MMMM MMM II M M IIMI II I II II II MM 



GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3000 

M II M M I Ml I MM II IMMI II IMMIIIIIMI 

GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3000 
TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 3060 

MIIM I Ml M IM M II IMM Mill III II II IM [ 

TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 3 060 



IIMIIIIIIIIIMIIIMIIIMIIMMIIIIMMIIIIIIMIIIIIIMIIII 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCA( 
AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAAG^ATTGTAAATTTCTTTTAGCC( 

M l! I MM M IM M II MMM MMM Ml II M Ml I 



Qy 3181 ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 



Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 


3961 


Qy 


4021 



m M Mil MINI MM II II INMIMI IN! 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 
GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 33 00 

I Ml Nl I INI MINNIIN Mil Mil I II 

GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 33 00 
GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGAGyVCCTGCCTCAGTCCATTTTAACC 3360 

Ml III MINI INN IMMIIIMM INI MUM 

GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 
TGTAG CAAC CTTCTGCATTCATAAAT CTTGTAAT CATGTTACCATTACAAATGGGATATA 3420 

N Mil N IN I II I III M MINI ill INN II INI 

TGTAG CAACCTTCTGCATTCATAAAT CTTGTAAT CATGTTACCATTACAAATGGGATATA 342 0 
AGAGG CAGCGTGAAAG CAGATGAG CTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 34 8 0 

I II M II I N INI I II II II II N II Ml 

AGAGG CAGCGTGAAAG CAGATGAG CTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 34 80 
TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 

I II MM | Ml MINIMI II IMM IININIM 

TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 



IIIIIMIIIIMIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMMIIIIII 



TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 

IIIIMIIIIIIMIIIIMIIIIIIIIIMIMIIIMIIIIIIIIIIMIIIIIIMI 

TGTCTGATATTTCTTT CAGACTTCG CCAGACAGATTG CTGATAATAAATTAGGTAAGATA 3660 
ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 372 0 

IIIIIIIIIMIIIIIIMIMIIIMIIIIIIIMIIMMIIIIIIMIIIIIIIIII 

ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 372 0 
CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3780 

IIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIM 

CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3780 
CATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 

NUN I IMN INIMIMMII I IMINII II II I 

CATC^TATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 
ACAATTGTCTTTTTTT CAAGATG CTTTGTTTCTTT CATATGAAAAAAATG CATTTTATAA 3900 

I NNI NMI I II Nil I NMIINNI IIIIMINI 

AC^TTGTCTTTTTTTC^GATGCTTTGTTTCTTTC^TATGAAAAAAATGCATTTTATAA 3 900 
ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3 960 

UN ! II N II N Nl N IN I II NMI II IM 

ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3 960 
TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 

Nl N illMI II N II N II N IIMMIMMI 

TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 020 



' : 1 1 II [ 1 , 1 1 1 M I II II 1 ! M M 1 1 1 1 M 1 1 1 1 1 1 ' 1 1 1 1 1 



Db 



4 021 AGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTG 4 080 



Qy 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 

IIIIIIIIIIIIIIIIIIIMIMI 

Db 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 



RESULT 4 

US-10-225-567A-115 

Sequence 115, Application US/10225567A 
Publication No. US20030113798A1 
GENERAL INFORMATION: 
APPLICANT: Lifespan Biosciences 
APPLICANT: Brown, Joseph P. 
APPLICANT: Burmer, Glenna C. 
APPLICANT: Roush, Christine L. 

TITLE OF INVENTION: ANTIGENIC PEPTIDES AND ANTIBODIES FOR G PROTEIN- COUPLED 
RECEPTORS (GPCRS) 

FILE REFERENCE: 1920-4-4 

CURRENT APPLICATION NUMBER: US/ 10/225 , 567A 
CURRENT FILING DATE: 2001-12-19 
PRIOR APPLICATION NUMBER: 60/257,144 
PRIOR FILING DATE: 2000-12-19 
NUMBER OF SEQ ID NOS : 2292 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 115 
LENGTH: 4105 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-225-567A-115 



Query Match 100.0%; Score 4105; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 4105; Conservative 0; Mismatches 



DB 15; Length 4105; 

0; Indels 0; Gaps 



0; 



Qy 


i 


Db 


i 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 



GAATTCG CGG CCG C CTCTTG CGGTCCCAGAGTGGAGTGGAAGGT CTGGAG CTTTGGGAGG 6 0 

Ih I II I MImII I II I II III I! II I Ml 'I I 

GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 6 0 
AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

ilillllMIII IIIIIMI MINIM IIIIIIMIilllll IIIIIIINMI 

AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 
CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 180 

IIIIIIMIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIMM! 

CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 180 
CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

MIIIIIIIIIIIMIIMMIIIIIIIIMIIMIIMIIIIIIIIMIIIIIIIIIII 

CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 



1 1 l II' II II 1 1 1 ■ l ' l f 1 1 1 ■ 1 1 1 1 ] i 1 1 1 1 j ' j I 1 . 1 1 1 . 



CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 

M I Ml II II II M, M MIMMMI, M III MMMM I 



360 



Db 



3 01 CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 



Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


n"h 


1 09 1 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 



TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 

i ^ ; i i i , i i ^ ; : : ! I i . 

TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 

AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 4 80 

i :' in ii ii ii mi i ii mi r M 1 i 'M 'mi! 

AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 4 80 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

MIIIIIIIMIIIMIIIIIIIIIIMIIIIIIIIIIMIMIIIIIIIIIIIIIIMI 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

CAGTGATAATCCTGAGAGATAC^GCACAAATCTAAGCAATCATGTGGATGATTTCACCAC 600 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMMIIIIIIIMMM 

CAGTGATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCAC 600 

TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIMMIIIMMIIIIIIIIIIIIIIIIIIII 

TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

ACCCAGC^TGGCTC^TGCACAACTATTGCCCAC^GCAGACTAAAATTACTTCAGCTTT 72 0 

IIIIIIMIIIIMIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIII 

ACCCAGCAATGGCTCAATGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTT 720 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 

II IIIIIMIIMI II I! II II lllill I II Ml IIMII II 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

Ml 1 1 1 1 M 1 1 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II II !l 1 1 1 1 1 1 1 II 1 1 1 II 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

IMIIIIIIIIIIIIIMIMIIIMMMIIIIIIMIMIIIIMIIIIIIIIIIIII 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

IIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIMI 

TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

IIMIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIII 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 1020 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIMIIIIMIII 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

M II .11 Ml I II I, II :ll :l II ,1 IIMIIMI'III , 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 1200 

Mlll!lllllllllllllllllll!llllll|||||||||||||||||||||||||||| 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 1200 



Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 



TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 12 60 

MM I! M Ml II IM M III I MIMI I MM IMIMM I 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 12 60 
CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

IIMMIIIIMIIIIIMIIIIIIIIIIIIIMIIMIIIIIIIIIMIIIIIIIMII 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 1320 
TTGTGAGATGTTGAACAGAAGGAATGGCAG CTTGAGAATTG CC CTCAGTGAACATCTTAA 138 0 

MM II I IMM M l Ml I MIMI I Ml I MMMM I 

TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 1380 
GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

i 'ii i ii m ii i ir i ii ii, i ii m ii r i 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 
GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

IIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIMMIIIIIIMIMIIIIIMMIII 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 
CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

MM M II Mill !M M || I MM I Mill IMIMMi I 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 
CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 1620 

MMI M Ml II II I IM M M !M II II M II I Ml 

CATGAATTCATGTATAAACC CCATAG CTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 162 0 
TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

IIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIMIIIMIIIIIIIMIIII 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 168 0 
CATGAACGGAACAAGC^TCCAGTGGAAGAACCACGATCAAAACAACCAC^C^CAGACCG 174 0 

! MMI M Ml Mill I IM II II MMI IM II II I M 

CATGAACGGAACAAGCATCCAGTGGAAGAACCAC 174 0 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

Ml III M IM I IMM M M MM I MIMI II II I II 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 
AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

M 1 M II IM I II I II I MM I MIMI M II IMM 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 
CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 

IIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIII 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 
CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 1980 

MMMIIIIIIIIIIMIIIIIIIIIMMMIIIIIIIIIIIIIIIIIIIIMIIIII 

CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 1980 



M MM, II I IMM II II IMIM I II ill IM I Ml 



Qy 


2041 


AAATGAAAC CAGAAGGATATTTACTACTTTTG CATGAAAATAGAGCTTT CAAGTACATGG 


2100 






M 1 III II II II M II 1 1 1 Mil || I II 1 M II II 1 1 1 Ml III 1! 1 1 M II 1 M M 




Db 


2041 


AAATGAAAPPAGAAGGATATTTAPTAPTTTTGPATGAAAATAGAGPTTTPAAGTAPATPP 


2100 


Qy 


2101 


CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 


2160 






MMMMMMMMMIMMMMMMMMMMMIMIMM IMIMIMM 




Db 


2101 


CTAGCTTTTATGGPAGTTPTGGTGAATPTTPAATGGGAAPTPPTPAPPATPAAAPTTTAP 


91 fiO 


Qy 


2161 


AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 


2220 






MMMIMMIIMIIMM MMMMMMMMMMMIIMMIMMMMI 




Db 


2161 


AGATTAACGACAAGATTTTCTAPTTTTTTTA AGTGA TTTTTTGT PPTTP A GPP A A A PA PA 




Qy 


2221 


ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 


2280 






IIMII III 1 Ml! Ill Mil IIIMIIMI MM MM III MM III MM MIIMI 




Db 


2221 


ATATGGGPTPAGGTPAPTTTTATTTGAAATPTPATTTPPTPPPAPTATTTTTTZi APTPPZV 


noon 

z z o u 


Qy 


2281 


TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 


2340 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 i 1 ! 1 1 1 




Db 


2281 


TAATAGPPTAAPATGATTATTTGA APTTATTTAPAPATAGTTTPA A A A A A h A A APAPA A Z\ 


9 7 A O 


Qy 


2341 


AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 


2400 






II 1 1 1 M II 1 1 II 1 II 1 1 1 II M M II 1 1 IE 1 II 1 1 1 1 1 1 1 ! II 1 1 1 1 1 II 1 1 1 1 M 1 1 




Db 


2341 


AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTPPAPGTPAPTATTTATTTTTTTA A A 


94 nn 

1 U \J 


Qy 


2401 


ACAOW^TTCTAAAGCTA(^aW^TACTACAGGCCCTTAAAGCACAGTCTGATGACACA 


2460 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2401 


ACACAAATTPTAAAGPTAPAAPAAATAPTAPAGGPPPTTAAAnPAPAPTPTriATnAPivPZV 




Qy 


2461 


TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 


2520 






1 1 1 1 1 1 I 1 1 1 1 1 1 i 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 I I I I I I I 




Db 


2461 


TTTGGCAGTTTAATAGATGTTAPTPAAAGAATTTTTTA APA A PTPTATTTTATTTTTTA A 

xxx v-nj >• — liu xxx nn x ,i"iV_JiT. x v_j x x xi. v_. x VwTUViVJrtrt. X X X X X X £i-r\\Jf\c\\ X Vj X £\ X X X X xi. X X X X X X t\c\ 


9 ^ 9 n 

Z 0 z u 


Qy 


2521 


ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 


2580 






1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 i 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2521 


ATGGTGTTTTATTACAAGGGAPPTTGAAPATGTTTTGTATGTTAAATTPAAAAPTAATGP 


9 Rft n 

^ JOU 


Qy 


2581 


TTC^TCAGATAGTTCTTTTTOVCAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 


2640 






1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 E 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 ] 1 1 1 1 1 1 




Db 


2581 


TTCAATCAGATAGTTCTTTTTPAPAAGTTPA ATA PTGTTTTTPA TPT A A A TTTTPT A TP A 

x x w in x Uiivjn x iiu xxv_xxxxx v^xl v__r^r^.vj x X vjiri X xlv_ X \J X X X X X V__rA. X \J X X X X X Vj X .rt. X \J.tt. 


9 fid n 


Qy 


2641 


AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 


2700 






1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2641 


AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTPT PA TTT A A PTPTP PPTP A PA PTT 


9700 


Qy 


2701 


TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 


2760 






1 1 1 1 1 ! 1 1 1 ! 1 1 1 1 1 1 ! 1 1 1 i 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M ! 1 1 1 1 1 1 1 1 1 I 




Db 


2701 


TCAGTGCACTGTATATAGAAGTPTAAAAPAPAPPTAAGAGAAAAAGATPPAATTTTTPAP 


97fi 0 


Qy 


2761 


ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 


2820 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2761 


ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 


2820 


Qy 


2821 


CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 


2880 






1 II II Ml 1 II I II MIMI II MIMI IM III II II 




Db 


2821 


CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 


2880 


Qy 


2881 


TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 


2940 



Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 



IMIIIIIIIIIIIIIIIIIMIMIIIIMIIIIIIIIMIIIIIIIIIIIIIIMIII 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 
GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3000 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIMMII 

GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3000 
TC^GTATGAACCTAACTCCCCACCCC^CATCTCCCTCCCACATTGTCACC^TTTCAAA^ 3060 

IIIIMIIIMIIIIMIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIMIIIIIIII 

TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 3060 
GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 312 0 

IIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIlllM 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 312 0 
AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 3180 

IIIIIIMIIIIMIIIIIMIIIIIMIIIIMIIIIIIIIIIIMIIIIIIIIMIII 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 318 0 
ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 

IIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIM 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 
GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 33 00 

IIMIIIMMMIIIIMIIMIMIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 33 00 
GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 

IIIIIIIMIIIIIIIIIIMIIIIIIIIIMIIIIIIIMMIIIIIIIIIIIIIIIII 

GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCO^TTTTAACC 3360 
TGTAGCAACCTTCTGC^TTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 342 0 

M MUI II I II M IMM IIMM II Mill II MM 

TGTAGC^CCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 342 0 
AGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 3480 

: I! M IMM II IMM IMM! Ill M M M II I MM 

AGAGGCAG CGTGAAAGCAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGT 348 0 
TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 

M'M M IMM MIMM IMM M IM IMIMM. IMMIMI II 

TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 
CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

IMIMIIIIIIIIIMIIIIIIIIMIIIIIIMIIMNIMIIIIIIIIIIIIIIII 

CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 
TGTCTGATATTTCTTTCAGACTTCG CCAGACAGATTG CTGATAATAAATTAGGTAAGATA 3660 

IIMMMIMIMMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIII 

TGTCTGATATTTCTTTCAGACTTCG CCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 
ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 372 0 

MMMIMIIIIIIIIIIIMMIIIIIIIIIIIIMIIIIIIIMIIIIIIMMIM 

ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 372 0 
CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCA.GTCAAATCTATTATTCCACTGGCG 3780 

M I. II, I Ml M, IM I II MMI II M IIIIIMM I 



Db 



3721 CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3780 



Qy 3781 CAT CATATGCAGTGATATATG CCTATAATATAAG CCATAGGTTCACACCATTTTGTTTAG 384 0 

Mill MMMMMMMIMMMIMMMIMM MIMMMIIIIII Ml 

Db 3781 CAT CATATGCAGTGATATATG CCTATAATATAAG CCATAGGTTCACACCATTTTGTTTAG 384 0 

Qy 3841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3 900 

1 I I : 1 1 - 1 1 I M I i 1 1 : 1 : 1 1 

Db 3 841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3 900 

Qy 3 901 ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3 960 

M I II M II II M l! Ml II Ml M II I II I I 

Db 3 901 ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3 960 

Qy 3961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 

II I Ml MMI IM MMIMM IMMI Ml II I II M; I 

Db 3 961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 

Qy 4 021 AGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTG 4 080 

I M III Ihll: IM I II II III II ll Mill II [ II 

Db 4 021 AGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTG 4 080 

Qy 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 

MM IIIIIIIIMIIIIIMI 

Db 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 



RESULT 5 

US-09-919-497-12 

; Sequence 12, Application US/09919497 

; Patent No. US20020106662A1 

; GENERAL INFORMATION: 

; APPLICANT: Mutter, George L. 

; TITLE OF INVENTION: PROGNOSTIC CLASSIFICATION OF ENDOMETRIAL CANCER 
; FILE REFERENCE: B0801/7225 

; CURRENT APPLICATION NUMBER: US/09/919 , 497 

; CURRENT FILING DATE: 2001-07-31 

; PRIOR APPLICATION NUMBER: US 60/221,735 

; PRIOR FILING DATE: 2000-07-31 

; NUMBER OF SEQ ID NOS : 100 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 12 

LENGTH: 2 595 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-919-497-12 

Query Match 60.2%; Score 2470; DB 10; Length 2595; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2481; Conservative 0; Mismatches 0; Indels 1; Gaps 1; 
Qy 1624 CCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCAT 1683 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

Db 15 CCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCAT 74 

Qy 1684 GAACGGAACAAGCATCCAGTGGAAGAACCA 1743 

Ml I III IMIIMI I MM II M II I II M Ml I IN 



Db 



75 GAACGGAACAAGCATCCAGTC 134 



Qy 


1744 


Db 


135 


Qy 


1804 


Db 


195 


Qy 


1864 


Db 


255 


Qy 


1924 


Db 


315 


Qy 


1984 


Db 


375 


Qy 


2044 


Db 


435 


Qy 


2104 


Db 


495 


Qy 


2164 


Db 


555 


Qy 


2224 


Db 


615 


Qy 


2284 


Db 


675 


Qy 


2344 


Db 


735 


Qy 


2404 


LJU 


7 q c 


Qy 


- 2464 


Db 


855 


Qy 


2524 


Db 


915 



CAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAAT 1803 

IIIIIIIIMIIIIIIIIIIIIIIMIIMIIIMIIIIIMIIIMIIIIIMIIMII 

CAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAAT 194 



IIIMIIIIMIIIMMMIIMIIIIIIIIIIIIIMIIMIIIIIIIMIMIMI! 

CCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTT 
CTTCCTTAATTCACTCCC^C^CCCAAGAAGAAATGCTTTCCAAAACCGC^GGTAGACTG 

M M 1 1 1 1 1 M 1 1 M M M I M 1 1 M II 1 1 1 1 1 1 M I I M 1 1 1 1 M I M 1 1 M 1 1 1 1 

CTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGACTG 
GTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTC 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIMIMIIIIIIMIIIIIIMM 



TG CGTGTTGTATTCAG CACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAA 2 043 

IIMIIIIIMIIIIIIIMIIIIIIIIIIIIIIIMMIIMIIIIIIIMIIIIMM 

TG CGTGTTGTATTCAG CACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAA 434 
TGAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATGGCTA 2103 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M I I II 1 1 1 1 1 II . I Mi 1 1 1 II II 1 1 M 1 1 1 1 1 1 M 1 1 1 1 

TGAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTT CAAGTACATGG CTA 4 94 



MIMIII MMMMMMMMIMM MIMMIII MIMIIMI MM 



TTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATA 2223 

IIIIIMIMIIIIIIIIMMIIIMIIMIIIIIIIIIIMIIIIIIIIIIMIMM 

TTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATA 614 



IIIIIIMMIIIIIIIIIMIIIMIIIIIMIIIIMIIMIIIIIIIIIIMIII 



TAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGAOyWiAT 2343 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 

TAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAAT 734 
AGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACA 24 03 

IIIIMIIMMIIIMIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIMMIIIIII 

AGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACA 7 94 
CAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGAC^C^TTT 2463 

IIIIIIIIMIIIIIMIIMIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIII 

CAAATTCTAAAGCTACAAOW^TACTACAGGCCCTTAAAGCACAGTCTGATGACACATTT 854 

GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 2 523 

I MM 'I h I I II II il I I. ! i ll II M I 
GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 914 



IIMIIIIIIIIMIIIIMIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 



Qy 


2584 


Db 


975 


Qy 


2644 


Db 


■ 1034 


Qy 


2704 


Db 


1094 


Qy 


2764 


Db 


1154 


Qy 


2824 


Db 


1214 


Qy 


2884 


Db 


1274 


Qy 


2944 


Db 


1334 


Qy 


3004 


Db 


1394 


Qy 


3064 


Db 


1454 


Qy 


3124 


Db 


1514 


Qy 


3184 


Db 


1574 


Qy 


3244 


Db 


1634 


Qy 


3304 


Db 


1694 


Qy 


3364 


Db 


1754 



2643 

IIIIIMIIIIIIMIIIIIMIIMIIII IIIIMMIIIIIIIIIIIIMIIIIMI 

AATCAGATAGTTCTTTTTCACAAGTTCAAT - CTGTTTTTCATGTAAATTTTGTATGAAAA 1033 



IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIMIIIMIIIMIIIIIIII 



GTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATG 2763 

IMIIIIIMIIIIIIMIIIIMIIIIIMIIIMIIIMIMIIIMIIIIIIIIIM 

GTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATG 1153 
ATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCAC 2823 

I III M I II I I II IMIIMI M MMM M Mill M 

ATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCAC 1213 
CTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTC 2883 

Mil I III III III I III Mill! MM MINIUM III III1IIMII MINN II 

CTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTC 1273 
CCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAA 2943 

II M I I! I I II M Mill MMIIMI I Mill 

CCTTTTCCATATAGGAAACATAATTTTGAAGTGGC CAGATGAGTTTATCATGT CAGTGAA 1333 



I MM I Mill IM I Ml M II II I II II M II IMM 



GTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAGGGC 3063 

M II I llllll III I Ml M II II I II II I II II I 

GTATGAACCTAACTCCCCACCCC^ACATCTCCCTCCCACATTGTCACCATTTCAAAGGGC 1453 
CCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAA 3123 

I II I M'MI IM I Ml I II II I II II I II M I 

CCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAA 1513 
ATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATT 3183 

I II I MMM III MM I II II I || || I || || ! 

ATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATT 1573 
TTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGT 3243 

M II M I II I II. IM M IIMMMMIIMI IIMI MM 

TTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGT 1633 
ATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTC 33 03 

II II I I II I M II I Ml MMM Mill I MIMM 

ATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTC 1693 
TGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGT 33 63 

IIIIIIIMIIIMIIIMIIIIIIIIIIIIIIIIMMIMIIIIIIIIIIIMIIIII 

TGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGT 1753 
AGCAACCTTCTGCATTCATAAATCTTGT^ 3423 

II II M M II I Ml IM M IIMMMMIIMI IIMI IMM 

AGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGA 1813 



Qy 


3424 


Db 


1814 


Qy 


3484 


Db 


1874 


Qy 


3544 


Db 


1934 


Qy 


3604 


Db 


1994 


Qy 


3664 


Db 


2054 


Qy 


3724 


Db 


2114 


Qy 


3784 


Db 


2174 


Qy 


3844 


Db 


2234 


Qy 


3904 


Db 


2294 


Qy 


3964 


Db 


2354 


Ov 


4 024 


Db 


2414 


Qy 


4084 


Db 


2474 



GGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGG 34 83 

I II II Ml II MMMHI ll MMM I III MMM MMI 

GGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGG 1873 
TTTGATAAAGCAGTATTTGGGGTCATATTGTTT CCTGTG CTGGAG CAAAAGT CATTACAC 3543 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIMIIIIII 

TTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACAC 1933 
TTTGAAGTATTATATTGTTCTTATCCT CAATT CAATGTGGTGATGAAATTGCCAGGTTGT 3603 

MM II MIMM IIMIIMM I Ml! II II III MMM I II 

TTTGAAGTATTATATTGTT CTTAT CCT CAATTCAATGTGGTGATGAAATTG CCAGGTTGT 1993 
CTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATT 3663 

I M II II II MUM II II MMM M M I IM I II 

CTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATT 2 053 
TGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGCAA 3723 

l I: M M II MM I II I MMM I IM I II, I I 

TGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGCAA 2113 
GGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCAT 3783 

II I Ml II II II I II M II IM I II M II M II 

GGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCAT 2173 
CATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAGACA 3843 

II I MM II II II I II I II III I II MMM MM 

CATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAGACA 2233 
ATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATT 3 903 

M I MM II II II I II I II III MMI IMMI Mill 

ATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATT 22 93 
CAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAA 3 963 

I I IM II II II I II I II M I II MMM MM 

CAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAA 2353 
CTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGT 4 023 

I M II II II MM MMM IMIIIMI M MMI IM I II 

CTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGT 2413 
AACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAA 4 083 

illll lllilllllllllM JiMIMI Mllllll IIIIIMIIIIIIII MM 

AACTTTGTAGAAATGAGC CAGAAG CCAAGG CCCTGAGTTGGCAGTGG CCCATAAGTGTAA 2473 
AATAAAAGTTTACAGAAACCTT 4105 

I II II II II Mi 

AATAAAAGTTTACAGAAACCTT 24 95 



RESULT 6 

US-10-204-752-30 

; Sequence 30, Application US/10204752 
; Publication No. US20030152956A1 
; GENERAL INFORMATION: 

; APPLICANT: OHTANI , No. US20030152956Aliko 
; APPLICANT: MATSUI , Keiko 
; APPLICANT: YOSHIDA, Nei 



APPLICANT: SUGITA, Yu j i 
APPLICANT : IZUHARA, Ken j i 

TITLE OF INVENTION: METHOD OF TESTING FOR ALLERGIC DISEASE 
FILE REFERENCE: SHZ-009US 

CURRENT APPLICATION NUMBER : US/10/204 , 752 
CURRENT FILING DATE: 2002-08-23 
PRIOR APPLICATION NUMBER: JP 2000-396166 
PRIOR FILING DATE: 2000-12-26 
NUMBER OF SEQ ID NOS : 37 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 3 0 
LENGTH: 2595 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-204-752-30 

Query Match 60.2%; Score 2470; DB 13; Length 2595; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2481; Conservative 0; Mismatches 0; Indels 1; Gaps 1; 
Qy 1624 CCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCAT 1683 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1: 1 1 1 1 1 1 1 1 II M 1 1 1 !M III 1 1 1 M 

Db 15 CCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCAT 74 

Qy 1684 G AA CGG AA CAAG CAT C CAG TGG AAGAA C CA CG AT CAAAACAA C CA CAA CA CAG A C CGG AG 1743 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMI 

Db 75 GAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACC^ 134 

Qy 1744 CAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAAT 18 03 

I I I - 1 I : i I : ! , : : i : 

Db 135 CAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAAT 194 

Qy 1804 CCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTT 1863 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIMIIIIII 

Db 195 CCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTT 254 

Qy 1864 CTTCCTTAATTCACTCCCA.CACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGACTG 1923 

: !■ ! i I'l MM I | | ! || , ' | || i 

Db 255 CTTCCTTAATTC^CTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGACTG 314 

Qy 1924 GTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTC 1983 

MM ;l HMM II I II II II Ih I M I II U I 

Db 315 GTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTC 374 

Qy 1984 TGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAA 2043 

II II I II L Ih'CIII I' IIMIM II I II I Ml 

Db 375 TGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAA 434 

Qy 2044 TGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGGCTA 2103 

I ! 1 1 1 II .1 .11111 II I 1 1 1 ! j ill i I i 1 1 1 i I : I ' II 

Db 435 TGAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAGCTTTCAAGTACATGG CTA 4 94 

Qy 2104 GCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGA 2163 

(■ M M I II i' II I II I INI' II II I. II I M 

Db 4 95 GCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGA 554 



Qy 



2164 TTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATA 2223 



Db 


555 


Qy 


2224 


Db 


615 


Qy 


2284 


Db 


675 


Qy 


2344 


Db 


735 


Qy 


2404 


Db 


795 


Qy 


2464 


Db 


855 


Qy 


2524 


Db 


915 


Qy 


2584 


Db 


975 


Qy 


2644 


Db 


1034 


Qy 


2704 


Db 


1094 


Qy 


2764 


Db 


1154 


Qy 


2824 


Db 


1214 


Qy 


2884 


Db 


1274 


Qy 


2944 


Db 


1334 


Qy 


3004 



IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIII 

TTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATA 614 
TGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAA 2283 

I Ml II II I II III I II! MMM I M, II II Ml IMII I 

TGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAA 674 
TAGCCTAACATGATTATTTGAACTTATTTACACZATAGTTTGAAAAAAAAAAGACAAAAAT 2343 

IIIIIIIIIIIIIIIIIIIIIIMIIMMIIIIIMIIIIIIIMIIIIIIIIIMIII 

TAGCCTAAC^TGATTATTTGAACTTATTTAC^C^TAGTTTGAAAAAAAAAAGAC^AAAT 734 
AGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACA 24 03 

II M I I Ml M M I II M M M II II: I! M II 

AGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACA 794 
C^W^TTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACATTT 24 63 

IIIIIIMIIIIIIIIMMIIIIIIIIMIIIIIIIIIIIIIIIMIIMIIIIIIIII 

CAAATTCTAAAGCTAC^C^WVTACTACAGGCCCTTAAAGCACAGTCTGATGACACATTT 854 
GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 2523 

Mill II Ml M II IM I I MMI M I Ml II MM Ml 

GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 914 

GTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTC 2583 

I II II M M II II II I M || || ;j IN IN |i I III 

GTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTC 974 

AATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAAA 2 643 

II MMM I M MIMI M MM 1 1 1 1 1 1 M 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 

AATCAGATAGTTCTTTTTCACAAGTTCAAT - CTGTTTTTCATGTAAATTTTGTATGAAAA 1033 
ATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCA 2703 

IMII M MMI II IM I M' IM I Ml I III MMM I Ml 

ATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCA 1093 



I IM I I I IMII I I II M I I II II I M MM 



MM M IM I M IM I. II M I M II MMI M M II I 



CTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTC 2883 

M IM II I M MMM M Mill, II I II II IIMMMi II 

CTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTC 1273 

CCTTTTCCATATAGGAAAC^TAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAA 2 943 

I I I M I II I I I I I I II I I I I I I I I I I I II I I I II I I I I I I I I M ! I I I II I I I I II 
CCTTTTC CATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGT CAGTGAA 1333 

AAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCA 3003 

IIIIMIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIM 

AAATAATTACCC^CAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCA 1393 
GTATGAACCTAACTCCCCACCCCAAC^TCTCCCTCCCAC^TTGTCACCATTTCAAAGGGC 3063 

IIIIIIIIIIIIIIMMIIIMIIIIIIIIIIMIMIMIMIIIIIMIIIIIIIII 



Db 


1394 


GTATGAACCTAACTCCCCACCCCAAC^ 


1453 


Qy 


3064 


CCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAA 

llllllllllEJllllJllllilllllllltlllllllllllllllllJIIIIIttltli 

1 1 1 1 1 M 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 

CCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAA 


3123 


Db 


1454 


1513 


Qy 


3124 


ATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATT 

IIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIMIIIIIIIMMIMI 

ATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATT 


3183 


Db 


1514 


1573 


Qy 


3184 


TTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGT 

IlilllllltlllllltJllltlllllllttllllllltllillllllllliiiiiiiii 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 | | | | | || | | | | | | 

TTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGT 


3243 


Db 


1574 


1633 


Qy 


3244 


ATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTC 

1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 II M 1 1 1 1 1 M 1 II 1 Ml 1 II Ml 1 MM 1 1 M 1 1 1 1 II II 1 

ATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTC 


3303 


Db 


1634 


1693 


Qy 


3304 


TGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGT 

MM 1 Ml II 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 Ml 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 II 1 Ml 1 

TGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGT 


3363 


Db 


1694 


1753 


Qy 


3364 


AGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGA 

||||tl||||t||1i| 1 1 1 1 I III llllliitiiitiiiti iiiiitii 

M 1 M 1 1 1 1 1 1 1 1 II II 1 1 II 1 1 II 1 M 1 1 II II M 1 II 1 1 1 1 M II 1 1 1 1 M 1 1 

AGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGA 


3423 


Db 


1754 


1813 


Qy 


3424 


GGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGG 

l t t i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i > iiiii 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I I | | | | | | | | | | | | | 

GGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGG 


3483 


Db 


1814 


1873 


Qy 


3484 


TTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACAC 

1 1 1 1 1 M II M 1 M 1 1 1 1 1 1 1 1 1 II 1 M 1 II 1 1 1 M 1 1 1 1 II MM 1 1 1 1 1 1 II 1 

TTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACAC 


3543 


Db 


1874 


1933 


Qy 


3544 


TTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGT 

1 1 1 1 M 1 1 1 1 1 1 M 1 II 1 1 M 1 1 1 1 1 II 1 MM MM 1 1 M 1 Ml M 1 1 1 1 1 1 1 1 M 1 1 

TTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGT 


3603 


Db 


1934 


1993 


Qy 


3604 


CTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATT 

I 1 t ! 1 1 1 1 1 1 1 1 1 1 1 1 l i l l l l i i i i i i i t i i i i i i i i t i i 

I I M 1 1 1 1 1 1.1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

CTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATT 


3663 


Db 


1994 


2053 


Qy 


3664 


TGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGCAA 

MM 1 MMM 1 1 1 1 1 II II M 1 1 1 1 MM 1 1 1 M 1 1 1 III 1 1 MM 1 1 1 1 1 1 II 1 1 

TGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGCAA 


3723 


Db 


2054 


2113 


Qy 


3724 


GGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCAT 

Mill MM MM MM IMMMI MMMMMMMMMMIMMIM 

GGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCAT 


3783 


Db 


2114 


2173 


Qy 


3784 


CATATG CAGTGATATATG CCTATAATATAAG CCATAGGTTCACACCATTTTGTTTAGACA 

MIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

CATATGCAGTGATATATG CCTATAATATAAG CCATAGGTTCACACCATTTTGTTTAGACA 


3843 


Db 


2174 


2233 


Qy 


3844 


ATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATT 

IIIMIIIMIIIIIIIMIMIIIIIIIIMIIIIIIIIIIIIIIMMIIIIIIMM 

ATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATT 


3903 


Db 


2234 


2293 



Qy 3 904 CAGAAAGTCATAGATTTCTGAAGG CGTCAACGTG CATTTTATTTATGGACTGGTAAGTAA 3963 

llllll MMIIIII II INI lllllll 111 MM MM I II II II III MM II MM! 

Db 2294 CAGAAAGTCATAGATTTCTGAAGG CGTCAACGTG CATTTTATTTATGGACTGGTAAGTAA 2353 

Qy 3 964 CTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGT 4 023 

M II IMMIIMM II Mill IMM Ml I MUM MM MM 

Db 2354 CTGTGGTTTACTAGC^GGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGT 2413 

Qy 4 024 AACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAA 4 083 

MIMIIIIMIIIIIIMIIIIIIIIIIMIIMIIIIIIIIIIIIIIIMIIIIMM 

Db 2414 AACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAA 2473 

Qy 4 084 AATAAAAGTTTACAGAAACCTT 4105 

IMIIIIIIIIIIIIIIIIIII 

Db 2474 AATAAAAGTTTACAGAAACCTT 24 95 



RESULT 7 

US-09-826-509-494 

Sequence 494, Application US/09826509 
Publication No. US20030204073A1 
GENERAL INFORMATION: 
APPLICANT: Lehmann -Bruins ma , Karin 
APPLICANT: Liaw, Chen W. 
APPLICANT: Lin, I -Lin 

TITLE OF INVENTION: No. US2 003 02 04 073A1 -Endogenous , Const itutively Activated 
Known G 

TITLE OF INVENTION: Protein-Coupled Receptors 
FILE REFERENCE: AREN-207 

CURRENT APPLICATION NUMBER: US/09/826 , 509 
CURRENT FILING DATE: 2001-04-05 
PRIOR APPLICATION NUMBER: 60/195,747 
PRIOR FILING DATE: 2000-04-07 
PRIOR APPLICATION NUMBER: 09/170,496 
PRIOR FILING DATE: 1998-10-13 
NUMBER OF SEQ ID NOS : 58 9 
SOFTWARE: Patentln Version 2.1 
SEQ ID NO 4 94 
LENGTH: 1284 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-826-509-494 

Query Match 31.2%; Score 1280.8; DB 13; Length 1284; 

Best Local Similarity 99.8%; Pred. No. 1.6e-308; 

Matches 1282; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 48 5 ATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGT 544 

1 1 1 1 1 1 i 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1 ATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGT 60 

Qy 54 5 GATAATCCTGAGAGATAC^GCACAAATCTAAGCAATCATGTGGATGATTTCACCACTTTT 604 

IIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIMIIIIIIII! 

Db 61 GATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCACTTTT 120 



Qy 605 CGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCTACCC 664 



Db 


121 


Qy 


665 


Db 


181 


Qy 


725 


Db 


241 


Qy 


785 


Db 


301 


Qy 


845 


Db 


361 


Qy 


905 


Db 


421 


Qy 


965 


Db 


481 


Qy 


1025 


Db 


541 


Qy 


1085 


Db 


601 


Qy 


1145 


Db 


661 


Qy 


1205 


Db 


721 


Qy 


1265 


Db 


781 


Qy 


1325 


Db 


841 


Qy 


1385 


Db 


901 


Qy 


1445 



IIMIIIIIIIIIIIMIMIIIMIIMIMIIIIIIIMMIIIIIIMIIIIIIMI 

CGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCTACCC 180 

AGCAATGGCTC^TGC^C^CTATTGCCCACAGCAGACTAAAATTACTTCAGCTTTCAAA 724 

I I I I I I II I I I I I I I I I I I II I I I I I I I I I I i I I I I I I I II I I I I I I I I i I I I I I I I I I I 
AGCAATGGCTCAATGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTTCA^ 24 0 

TACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACT 784 

I II Ml M II MM M Ml II ill IM II M II II 

TACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACT 300 
CTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGATAGCC 844 

MMMI III I MMM II MIMM III M MM II I! Ml 

CTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGATAGCC 360 
AGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAG 904 

MMM II M II Ml M Ml Ml Ml M MM II II M 

AGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAG 42 0 
CTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTG 964 

II IM IMM II II II II IM II I II II MIMM 

CTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTG 48 0 
TTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGT 1024 

IIMIIIIIIIIMIIIIIIIIIMIMIIIIIIIIIIMIIIIIIIMIIIIMIIIM 

TTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGT 54 0 
GTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTG 1084 

IIIMIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 

GTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTG 600 
GTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAA 1144 

IIIMIMIMIIMIIIIMIMIIMMMIIIIMIIIIMIMIIIMIIIIIMI 

GTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAA 660 
GCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATG 12 04 

II IM III M II Ml II III Ml IM I II II MMM 

GCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATG 72 0 
CTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTC 12 64 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTC 780 
GGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGT 1324 

Ml Ml III II MMM II III Ml Ml II MMMIMM I 

GGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGT 84 0 
GAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAAGCAG 1384 

I II Ml IM I !M II M M" M- I M II IMMM 

GAGATGTTGAACAGAAGGAATGG CAG CTTGAGAATTG CCCTCAGTGAACATCTTAAG CAG 900 
CGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTC 1444 

Illlllllllll I I Ml II II M II IMM Ml Ml II 

CGTCGAGAAGTGAAAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTC 960 
CCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGA 1504 

M MM IM M MMI II II 'M Ml MMM MIMM II 



Db 961 CCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGA 102 0 

Qy 1505 TGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATG 1564 

IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIMIII 

Db 1021 TGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATG 108 0 

Qy 1565 AATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTC 1624 

IIIIIIIIIIMIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

Db 1081 AATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTC 114 0 

Qy 1625 CAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCATG 1684 

1 1 1 1 1 ; 1 1 1 1 1 i 1 1 1 II 1 1 1 1 li 1 1 1 I II I I! M 1 1 1 M 1 1 I II 1 1 1 1 1 1 1 Mi 1 1 II 

Db 1141 CAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCATG 12 00 

Qy 1685 AACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACC^ 1744 

IIIMIIIIIIIIIIIIMMIIIMIIMIIMIIIIIIMIIIIIIIMIIIIIIIII 

Db 12 01 AACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACCAC^ACACAGACCGGAGC 12 60 

Qy 1745 AGCCATAAGGACAGCATGAACTGA 1768 

IIMIIIIIIMIMIIIIIMII 

Db 12 61 AGCCATAAGGACAGCATGAACTGA 1284 



RESULT 8 

US-10-101-510-59 

; Sequence 59 # Application US/10101510 

; Publication No. US20030148295A1 

; GENERAL INFORMATION: 

; APPLICANT: WAN, JACKSON 

; APPLICANT: WANG, YIXIN 

; TITLE OF INVENTION: EXPRESSION PROFILES AND METHODS OF USE 
; FILE REFERENCE: 15117.0012 

; CURRENT APPLICATION NUMBER: US/10/101 , 510 

; CURRENT FILING DATE: 2002-03-20 

; PRIOR APPLICATION NUMBER: 60/276,947 

; PRIOR FILING DATE: 2001-03-20 

; NUMBER OF SEQ ID NOS : 8 05 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 59 

LENGTH: 560 

TYPE : DNA 

ORGANISM: Homo sapiens 
US-10-101-510-59 



Query Match 12.7%; Score 521; DB 13; Length 560; 

Best Local Similarity 97.7%; Pred. No. 3.4e-119; 

Matches 550; Conservative 0; Mismatches 10; Indels 3; Gaps 2; 



Qy 


2464 


GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 


2523 






1 M MIIMI 1 Ml MMIIMM III M MMI 1 II II 1 




Db 


1 


GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 


60 


Qy 


2524 


GTGTTTTATTACAAGGGACCTTGAACA.TGTTTTGTATGTTAAATTCAAAAGTAATGCTTC 


2583 






1 MMI II, M II h II MMI MM 1 Ml II MIMM 1 




Db 


61 


GTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTC 


120 


Qy 


2584 


AATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAAA 


2643 



Db 


121 


Qy 


2644 


Db 


180 


Qy 


2704 


Db 


240 


Qy 


2764 


Db 


300 


Qy 


2824 


Db 


360 


Qy 


2884 


Db 


420 


Qy 


2944 


Db 


480 


Qy 


3004 


Db 


538 



I: MMI I M Ml II M lllllllllllllllllllllllllllll 



ATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCA 2703 

II MMI M II Ml II I I II II Mill MIMM MM II 

ATCAATGTC^GTACCAAAATGTTAATGTATGTGTO\TTTAACTCTGCCTGAGACTTTCA 23 9 
GTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATG 2763 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMI 

GTG CACTGTATATAGAAGTCTAAAACACAC CTAAGAGAAAAAGAT CGAATTTTTCAGATG 2 99 



III II I MM II I I II II I I II IM | M II 



CTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTC 2883 
llllllhllll I 1.1:1111 I ' . I : MINIUM Mill 

CTCCTATTCTCTTAATTTTTCTTAAAATGTTAACTGGCAGTAAAGCTTTTTTGATCATTC 419 
CCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAA 2 943 

M Mill: II I II II: M II II II I II II M II II III 

CCTTTT CCATATAGGAAACATAATTTTGAAGTGG CCAGATGAGTTTATCATGTCAGTGAA 479 
AAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCA 3 003 

MM I II II I I MMM I I I MMI MMM I I II 

AAATTAATACCCACAAATGGCACCAG - -AACTTACGATTCTTCACTTCTTGGGGTTTTCA 537 
3004 GTATGAACCTAACTCCCCACCCC 3 026 

1 1 1 1 1 Ml 1 1 1 M II 1 1 1 1 1 1 

53 8 GTATGAACCTAACTCCCCACCCC 560 



RESULT 9 

US-09-918-995-8960 

; Sequence 8960, Application US/09918995 
; Publication No. US20030073623A1 
; GENERAL INFORMATION: 
; APPLICANT: Hyseq, Inc. 

; TITLE OF INVENTION: NOVEL NUCLEIC ACID SEQUENCES OBTAINED 
; TITLE OF INVENTION: FROM VARIOUS CDNA LIBRARIES 
; FILE REFERENCE: 20411-756 

; CURRENT APPLICATION NUMBER: US/09/918,995 
; CURRENT FILING DATE: 2001-07-30 

PRIOR APPLICATION NUMBER: US/ 09/235 , 076 
; PRIOR FILING DATE: 1999-01-20 
; NUMBER OF SEQ ID NOS : 38054 
; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 8960 

LENGTH: 451 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/ KEY : misc_feature 
LOCATION: (1) . . . (451) 
OTHER INFORMATION: n = A,T,C or G 
US-09-918-995-8960 



Query Match 10.0%; Score 412.4; DB 11; Length 451; 

Best Local Similarity 98.6%; Pred. No. 3.6e-92; 

Matches 416; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 



Qy 334 7 AGTCCATTTTAACCTGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATT 34 06 

I I llllllllllllllllllllll 'i ll IN I Ml II! I III III I 

Db 2 9 AATTCATTTTAACCTGTAGCAACCTTTTGCATTCATAAATCTTGTAATCATGTTACCATT 88 

Qy 34 07 ACAAATGGGATATAAGAGG CAGCGTGAAAGCAGATGAG CTGTGGACTAG CAATATAGGGT 34 66 

IMIIIII IIIIIIIIIIIIIIMMMIIIIIIMIIIIIIIIIIIIIIIIMIMM 

Db 8 9 ACAAATGGTATATAAGAGG CAGCGTGAAAGCAGATGAGCTGTGGACTAG CAATATAGGGT 14 8 

Qy 3467 TTTGTTTGGTTGGTTGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGG 3526 

IIIIMIIIIIIIIIIIIIIIIIIII llllllllllllllllllll IMIIIIIMM 

Db 14 9 TTTGTTTGGTTGGTTGGTTTGATAAAACAGTATTTGGGGTCATATTGATTCCTGTGCTGG 208 

Qy 3527 AGCAAAAGTCATTACACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGA 358 6 

IIMIIIIMIIIIIIMIIIIMIIIIIIIIIMIIMIMIIIIIIIIIIIIMIIII 

Db 209 AGCAAAAGTCATTACACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGA 268 

Qy 3 587 TGAAATTG CCAGGTTGT CTGATATTTCTTTCAGACTTCG C CAGACAGATTGCTGATAATA 3646 

1 1 II I M 1 1 1 1 1 1 1 II I M 1 1 II 1 1 1 1 II II 1 1 1 MM 1 1 II 1 1 1 1 1 II 1 1 1 1 Ml I 

Db 269 TGAAATTG CCAGGTTGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATA 32 8 

Qy 3 647 AATTAGGTAAGATAATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCC 3706 

MMI M ill II I III II I MIDI IMIMMI Mill II 

Db 32 9 AATTAGGTAAGATAATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCC 388 

Qy 3707 AGTTG CTTGAATTG CAAGG CTAAGAAGTACTGC CCTTTTGTGTGTTAG CAGTCAAATCTA 3766 

IIIIIIIIIIIMIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIMMIIIIIIII 

Db 389 AGTTGCTTGAATTGCAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTA 448 

Qy 3767 TT 3768 

II 

Db 449 TT 450 



RESULT 10 

US-10-007-926A-177 

Sequence 177, Application US/10007926A 
Publication No. US20030143539A1 
GENERAL INFORMATION: 
APPLICANT: BERTUCCI , FRANCOIS 
APPLICANT: HOULGATTE , REMI 
APPLICANT: BIRNBAUM, DANIEL 
APPLICANT: NGUYEN, CATHERINE 
APPLICANT: VI ENS, PATRICE 
APPLICANT: FERT, VINCENT 

TITLE OF INVENTION: GENE EXPRESSION PROFILING OF PRIMARY BREAST CARCINOMAS 
TITLE OF INVENTION: USING ARRAYS OF CANDIDATE GENES 
FILE REFERENCE: 1546-R-00 

CURRENT APPLICATION NUMBER: US/1 0/ 0 07 , 926A 
CURRENT FILING DATE: 2001-12-07 
PRIOR APPLICATION NUMBER: 60/254,090 
PRIOR FILING DATE: 2000-12-08 
NUMBER OF SEQ ID NOS : 468 
SOFTWARE : Patent In Ver . 2.1 



SEQ ID NO 177 
LENGTH: 4286 
TYPE : DNA 

ORGANISM: Homo sapiens 
FEATURE : 

OTHER INFORMATION: endothelin receptor type b (EDNRB) gene. 
US-10-007-926A-177 

Query Match 9.6%; Score 395.8; DB 13; Length 4286; 

Best Local Similarity 65.7%; Pred. No. 2.6e-87; 

Matches 634; Conservative 0; Mismatches 307; Indels 24; Gaps 3; 
Qy 689 TGCCCACAGCAGACTAAAATTACTTCAGCTTTCAAATACATTAACACTGTGATATCTTGT 748 

Mil I II I II I IIIIIIIIIIMI Mill II I II II 

Db 505 TGCCAAGGACCCA.TCGAGATCAAGGAGACTTTCAAATACATCAACACGGTTGTGTCCTGC 564 

Qy 74 9 ACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTACCAGAAC 808 

llllll 1 1 1 1 II I Mill I II II II II II II III Mill 

Db 565 CTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAAC 624 

Qy 809 AAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATC 868 

II II III I II M Mill MM Mill I II M IMIIMI I 

Db 625 AAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTG 684 

Qy 869 TATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTT 92 8 

I I llllllll MIMIMMIM I IMMIIMM I I llllll 
Db 685 CACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCC 74 0 

Qy 92 9 GATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCG 988 

Mill I I II llllll I II II I Mill MM 

Db 741 ATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCC 789 

Qy 98 9 GTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCC 104 8 

Mill Mill II II I II II Mill III lllllll II MM Mill 

Db 790 GTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCT 84 9 

Qy 104 9 TCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCC 1108 

II MMII I II I llllllll! MM II II I IMIIMI I 

Db 850 TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 909 

Qy 1109 ATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCC 1168 

II Ml I I II I I Mill I IMIIMI M II II I II I I 

Db 910 ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 969 
Qy 1169 TTTGAATATAGGGGTGAACAG CATAAAAC CTGTATG CT CAATG C C ACATCAAAA 1222 

I II II I II II II III MM II II I II 

Db 970 ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 102 9 

Qy 1223 TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 1282 

MMII MM Ml I I III II I i 1 1 1 1 1 1 Ml I IMIIMMM II 

Db 103 0 TTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 1089 

Qy 1283 CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 1342 

llllll llllll I II II II II Mill Mill MMII MM I 

Db 1090 CCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG AGAAAG 114 6 



Qy 



1343 AATGG CAG CTTGAGAATTGCCCTCAGTGAACATCTTAAG CAGCGTCGAGAAGTGG CAAAA 14 02 



Db 


1147 


ft 1 II II lllll 1 1 til II 11 llllll 1 II Itlldlll ill 

II 1 II II Mill 1 1 III II II llllll 1 II llllll III 

AAAAGTGG CATGCAGATTG CTTTAAATGATCACCTAAAG CAGAGACGGGAAGTGGCCAAA 


1206 


Qy 


1403 


ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGT 
ii ii ii iii i i i i i i iiiii ii iiiiii i ti iiiiii t lit ■ 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 

ACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGG 


1462 


Db 


1207 


1266 


Qy 


1463 


ATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTC 

II lllll 111 1 lllll 1 1 II II llllilll l • * * it 

II Mill III 1 lllll II II II II II II II 1 1 II II 

ATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTT 


1522 


Db 


1267 


1326 


Qy 


1523 


TTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCC 

1 II 1 lltl II II lllll III lilt 1 IIIJlll il II 14111 

1 II 1 MM II II Mill Ml MM 1 MM II II lllll 

CTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCA 


1582 


Db 


1327 


1386 


Ov 


1583 


ATAGCTCTGTATTTTCJTnACJ PA ACJ A A A TTT A A A A A TTf^TTT PP A P,TP A TannTnTn PTP p 

II Ml IIIIIIM | Ml MIM || || IIMIIII 1 llllll 

ATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGC 


i a a o 


Db 


1387 


1446 


Qy 


1643 


TGCTG 1647 
II II 

TGGTG 14 51 




Db 


1447 





RESULT 11 
US-09-921-406C-13 

Sequence 13, Application US/09921406C 
Publication No. US20030152923A1 
GENERAL INFORMATION: 
APPLICANT: Yakhini, Zohar 
APPLICANT: Ben-Dor, Amir 
APPLICANT: Sampas, Nick 
APPLICANT: Dougherty, Edward 
APPLICANT: Trent, Jeff 
APPLICANT: Meltzer, Paul 
APPLICANT: Chen, Yidong 
APPLICANT: Weeraratna, Ashani 
APPLICANT: Jiang, Yuan 
APPLICANT: Bittner, Michael 
TITLE OF INVENTION: Classifying Cancers 
FILE REFERENCE: 10010313-1 

CURRENT APPLICATION NUMBER: US/09/921 , 406C 
CURRENT FILING DATE: 2001-08-02 
NUMBER OF SEQ ID NOS : 41 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 13 
LENGTH: 4286 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-921-406C-13 



Query Match 9.6%; Score 395.8; DB 13; Length 4286; 

Best Local Similarity 65.7%; Pred. No. 2.6e-87; 

Matches 634; Conservative 0; Mismatches 307; Indels 24; Gaps 



3; 



Qy 



68 9 TGCCC^CAGGAGACTAAAATTACTTCAGCTTTCAAAT^ 748 

MM I II I II I IMMMMIIM lllll II I II II 



Db 


505 


TGCCAAGGACCCATCGAGATC^VAGGAGACTTTCAAATACATCAACACGGTTGTGTCCTGC 


564 


Qy 


749 


ACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTACCAGAAC 

MINI MM M 1 MMI I M M M M M M Ml MMI 

CTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAAC 


808 


Db 


565 


624 


Qy 


809 


AAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATC 

M M Ml 1 II II Mill MM Mill 1 II II llllllll 1 

AAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTG 


868 


Db 


625 


684 


Qy 


869 


TATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTT 

1 1 MMM IIIIMIIIIMI 1 IMMIMIM 1 1 MMM 

CACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCC 


928 


Db 


685 


740 


Qy 


929 


GATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCG 

MMI 1 1 II MMM MMI 1 MMI MM 

ATTTGGAGCTGAGATGTGTAAG CTGGTG C CTTTCATACAGAAAG CCT CC 


988 


Db 


741 


789 


Qy 
Db 


989 
790 


GTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCC 

Mill MMI II II 1 II II MM III MMM M MM MM 

GTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCT 


1048 
849 


Qy 


1049 


TCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCC 

II WWW 1 II 1 MIMIIII Ml II II 1 llllllll 1 

TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCZAAAATGGACAGCAGTAGAAATTGTTTTG 


1108 


Db 


850 


909 


Qy 


1109 


ATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCC 

MMM III 1 MMM MIMIIII M II III II 1 1 

ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 


1168 


Db 


910 


969 


Qy 


1169 


TTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCC ACATCAAAA 

Mill 1 II II II III MM II II 1 II 
ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 


1222 


Db 


970 


1029 


Qy 


1223 


TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 

MMM MM Ml 1 1 III II MMM III 1 MMIIIMI M 

TTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 


1282 


Db 


1030 


1089 


Qy 


1283 


CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 

M Ml llllll 1 M II II II MM MM llllll Ml 1 

CCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG AGAAAG 


1342 


Db 


1090 


1146 


Qy 


1343 


AATGGCAGCTTGAGAATTGCCCTCAGTGAACA.TCTTAAGCAGCGTCGAGAAGTGGCAAAA 

II 1 M II MM 1 1 Ml M II WWW 1 II MMM Ml 

AAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAA 


1402 


Db 


1147 


1206 


Qy 


1403 


ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGT 

II M II Ml Ml 1 1 MM II WWW 1 II WWW 1 III 1 

ACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGG 


1462 


Db 


1207 


1266 


Qy 


1463 


ATATTGAAGAAAACTGTGTATAACGAAATGGACAAO A A CCC1ATGTGA ATTA PTTAflTTTP 

II Mill III 1 MMI 1 1 II II llllllll 1 Mill 

ATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTT 




Db 


1267 


1326 


Qy 


1523 


TTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCC 

1 II 1 MM II II Mill Ml MM 1 Ml 1 || Mill 

CTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCA 


1582 


Db 


1327 


1386 



Qy 1583 ATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGC 1642 

I! 1 1 1 1 1 1 1 1 1 1 1 llllllll I III Mill II II MINIM I llllll 

Db 1387 ATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGC 1446 

Qy 1643 TGCTG 1647 

II II 

Db 1447 TGGTG 14 51 



RESULT 12 
US-10-210-120-15 

; Sequence 15, Application US/10210120 

; Publication No. US20030175736A1 

; GENERAL INFORMATION: 

; APPLICANT: Chinnaiyan, Arul M . 

; APPLICANT: Rubin, Mark A. 

; APPLICANT: Sreekumar, Arun 

; TITLE OF INVENTION: Expression Profile of Prostate Cancer 
; FILE REFERENCE: UM- 07221 

; CURRENT APPLICATION NUMBER: US/10/2 10 , 12 0 

; CURRENT FILING DATE: 2002-08-01 

; PRIOR APPLICATION NUMBER: US 60/309,581 

; PRIOR FILING DATE: 2001-08-02 

; PRIOR APPLICATION NUMBER: US 60/334,468 

; PRIOR FILING DATE: 2001-11-15 

; NUMBER OF SEQ ID NOS : 123 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 15 

LENGTH: 4286 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-210-120-15 

Query Match 9.6%; Score 395.8; DB 13; Length 4286; 

Best Local Similarity 65.7%; Pred. No. 2.6e-87; 

Matches 634; Conservative 0; Mismatches 3 07; Indels 24; Gaps 3; 



Qy 689 TGCC(^CAGCAGACTAAAATTACTTCAGCTTTCAAATACATTAACACTGTGATATCTTGT 748 

1 1 1 1 I II I II I 111,1 Mill II I II II 

Db 505 TGCCAAGGACCC^TCGAGATCAAGGAGACTTTCAAATACATCAACACGGTTGTGTCCTGC 564 

Qy 74 9 ACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTACCAGAAC 8 08 

llllll MM II I Mill I II II II II II II III Mill 

Db 565 CTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAAC 624 

Qy 809 AAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATC 868 

II II Ml I II II Mill MM Mill I II II IIIMIII I 

Db 625 AAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTG 684 

Qy 869 TATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTT 928 

I I IIIMIII MMMMIMM I 1 1 1 1 1 1 1 II E I I I llllll 

Db 685 C^CATCGTC^TTGAC^TCCCTATCAATGTCTAC^AGCTGCTGGCAGAGGACTGGCC 74 0 

Qy 929 GATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCG 988 

Mill I I II llllll I II II I Mill MM 

Db 741 ATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCC 789 



Qy 989 GTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCC 104 8 

Mill Mill II II I II II Mill III lllllll M MM Mill 

Db 790 GTGGGAATCACTGTG CTGAGTCTATGTG CTCTGAGTATTGACAGATATCGAGCTGTTG CT 84 9 

Qy 104 9 TCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCC 1108 

II llllll I II I lllllllll MM II M I llllllll I 

Db 850 TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 909 

Qy 1109 ATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCC 1168 

II lllllllll Mill I llllllll II lllllllll 

Db 910 ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 969 

Qy 1169 TTTGAATATAGGGGTGAACAG CATAAAACCTGTATG CTCAATG CC ACATCAAAA 1222 

llllll II I I II III MM II II I II 

Db 970 ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT '1029 

Qy 1223 TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 12 82 

llllll MM III I I III II llllllll III I I II II || 

Db 103 0 TTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 108 9 

Qy 12 83 CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 1342 

II MM llllll I II II II II Mill Mill MIMI MM I 

Db 1090 CCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG AGAAAG 114 6 

Qy 1343 AATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAAGCAGCGTCGAGAAGTGGCAA^ 14 02 

II I M II Mill I I III II II MIMI I II llllllll III 

Db 1147 AAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAA 12 06 

Qy 14 03 ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGT 14 62 

II II II Ml MM I I Mill II llllll I II llllll I Ml I 

Db 12 07 ACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGG 1266 

Qy 14 63 ATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAAC CGATGTGAATTACTTAGTTT C 1522 

M Mill III I Mill MM II 1 1 1 1 1 1 [ I I Mill 

Db 12 67 ATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTT 1326 

Qy 1523 TTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCC 1582 

I M I MM II II Mill III MM I lllllll II II Mill 

Db 1327 CTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCA 13 86 

Qy 1583 ATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGC 1642 

II .11 I i If llllllll I III Mill II II I II II I II I llllll 

Db 1387 ATTG CTCTGTATTTGGTGAG CAAAAGATTCAAAAACTG CTTTAAGTCATG CTTATG CTG C 1446 

Qy 1643 TGCTG 1647 

II II 

Db 1447 TGGTG 1451 



RESULT 13 

US-10-225-567A-113 

; Sequence 113, Application US/10225567A 
; Publication No. US20030113798A1 
; GENERAL INFORMATION: 
; APPLICANT: Lifespan Biosciences 
APPLICANT: Brown, Joseph P. 



r 



APPLICANT: Burmer, Glenna C. 
APPLICANT: Roush, Christine L. 

TITLE OF INVENTION: ANTIGENIC PEPTIDES AND ANTIBODIES FOR G PROTEIN- COUPLED 
RECEPTORS (GPCRS) 

FILE REFERENCE: 1920-4-4 

CURRENT APPLICATION NUMBER: US/10/225 , 567A 
CURRENT FILING' DATE: 2001-12-19 
PRIOR APPLICATION NUMBER: 60/257,144 
PRIOR FILING DATE: 2000-12-19 
NUMBER OF SEQ ID NOS : 22 92 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 113 
LENGTH: 42 86 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-225-567A-113 

Query Match 9.6%; Score 395.8; DB 15; Length 4286; 

Best Local Similarity 65.7%; Pred. No. 2.6e-87; 

Matches 634; Conservative 0; Mismatches 307; Indels 24; Gaps 3; 
Qy 689 TGCCCAC^GCAGACTAAAATTACTTO^GCTTTCAAATACATTAACACTGTGATATCTTGT 74 8 

M 1 1 I II I II I IIIIIIIIIIIII Mill II I II II 

Db 5 05 TGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATCAACACGGTTGTGTCCTGC 564 

Qy 74 9 ACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTACCAGAAC 8 08 

I I Ml I 1 1 1 1 II I Mill I II II II II II II III Mill 

Db 565 CTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAAC 624 

Qy 809 AAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATC 868 

II II Ml I II II Mill MM Mill I II II llllllll I 

Db 62 5 AAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTG 684 

Qy 869 TATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTT 928 

I I llllllll IIIIIIIIIIIII I IIMMIMM I I MIMI 

Db 685 CACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCC 74 0 

Qy 92 9 GATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCG 988 

Mill I I II MINI I II II I Mill MM 

Db 741 ATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCC 78 9 

QY 989 GTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCC 104 8 

Mill Mill II II I II II Mill III lllllll II MM Mill 

Db 790 GTGGGAATCACTGTG CTGAGTCTATGTG CTCTGAGTATTGACAGATATCGAGCTGTTG CT 84 9 

Qy 104 9 TCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCC 1108 

II MIMI I II I 1 1 ! 1 1 1 1 MM II II I llllllll I 

Db 850 TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 909 

Qy 1109 ATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCC 1168 

II MM I II I I Mill I IIIIMM II II II I II I I 

Db 910 ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 969 
Qy 1169 TTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCC ACATCAAAA 1222 

I M M I II II II III MM II II I II 

Db 970 ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 1029 



Qy 1223 TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 1282 

Mill! I II I III I I III II llllllll III I lllllllllll II 

Db 1030 TTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 1089 

Qy 1283 CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 1342 

II II II MINI I II II II II Mill Mill MUM MM I 

Db 1090 CCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG AGAAAG 1146 

Qy 1343 AATGG CAG CTTGAGAATTG CCCTCAGTGAACATCTTAAG CAGCGTCGAGAAGTGGCAAAA 14 02 

ii i ii ii mil i i iii ii ii Mini i ii Miiiiii iii 

Db 1147 AAAAGTGGCATGCAGATTG CTTTAAATGATCACCTAAAG CAGAGACGGGAAGTGG CCAAA 12 06 

Qy 14 03 ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGT 14 62 

II II II Ml MM I I Mill M llllll I II MUM I III I 

Db 12 07 ACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGG 1266 

Qy 1463 ATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTC 1522 

II Mill III I Mill I I II II llllllll I I II II 

Db 1267 ATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTT 1326 

Qy 1523 TTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCC 1582 

I II I 1 1 1 1 II II Mill III MM I IMIMI II II Mill 

Db 1327 CTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCA 1386 

Qy 1583 ATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGC 1642 

M MIIMIMM IIIIMM I III Mill II II llllllll I MUM 

Db 13 87 ATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGC 1446 

Qy 1643 TGCTG 1647 

II II 

Db 1447 TGGTG 1451 



RESULT 14 
US-09-931-157-2 

; Sequence 2, Application US/09931157 

; Patent No. US20020082414A1 

; GENERAL INFORMATION: 

; APPLICANT: Imura, Hiroo 

; APPLICANT: Nakao, Kazuwa 

; APPLICANT: Nakanishi, Shigetada 

; TITLE OF INVENTION: Human Endothelin Receptor 

; FILE REFERENCE: 299002032411 

; CURRENT APPLICATION NUMBER: US/09/931,157 

; CURRENT FILING DATE: 2001-10-15 

PRIOR APPLICATION NUMBER: 08/121,446 
; PRIOR FILING DATE: 1993-09-14 
; PRIOR APPLICATION NUMBER: 07/911,684 
; PRIOR FILING DATE: 1992-07-10 
; PRIOR APPLICATION NUMBER: JP 3-172828 
; PRIOR FILING DATE: 1991-07-12 
; NUMBER OF SEQ ID NOS : 2 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 2 

LENGTH: 43 01 

TYPE: DNA 

ORGANISM: Homo Sapiens 



FEATURE : 
NAME /KEY: CDS 
LOCATION: (238) . . . (1566) 
US-09-931-157-2 

Query Match 9.6%; Score 395.8; DB 9; Length 4301; 

Best Local Similarity 65.7%; Pred. No. 2.6e-87; 

Matches 634; Conservative 0; Mismatches 307; Indels 24; Gaps 3; 
Qy 68 9 TGCCCACA.GCAGACTAAAATTACTTCAGCTTTCAAATACATTAACACTGTGATATCTTGT 74 8 

MM I II MM Ml M MM II I II II 

Db 505 TGCCAAGGACCC^TCGAGATC^GGAGACTTTaU^TACATCAACACGGTTGTGTCCTGC 564 

Qy 74 9 ACTATTTTCATCGTGGGAATGGTGGGGAATGC^CTCTGCTC^GGATaVTTTACCAGAAC 8 08 

I Mill MM II I I II 1 1 I II M II II II II III I II II 

Db 565 CTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAAC 624 

Qy 8 09 AAATGTATGAGGAATGG CCCCAACGCG CTGATAG CCAGTCTTG CC CTTGGAGACCTTATC 868 

II 1 1 III I II II Mill 1 1 1 1 Mill I II II MMMM I 

Db 62 5 AAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTG 684 

Qy 869 TATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTT 928 

I I Illlllll MIMMMMM I IIIIIIIIIII I I MUM 

Db 685 C^CATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCC 74 0 

Qy 92 9 GATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCG 988 

1 1 1 II I I II llllll Mill I I II 1 1 MM 

Db 741 ATTTGGAG CTGAGATGTGTAAGCTGGTG CCTTTCATACAGAAAGC CT CC 789 

Qy 989 GTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCC 104 8 

Mill Mill II II I II II II I II Ml lllllll II MM I II 1 1 

Db 790 GTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCT 84 9 

Qy 104 9 TCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCC 1108 

II llllll I II I IMIIIIII MM II II I MMMM I 

Db 850 TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 909 

Qy 1109 ATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCC 1168 

M IMIIIIII Mill I MMMM II II II I II I I 

Db 910 ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 969 
Qy 1169 TTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCC ACATCAAAA 1222 

I M M I II II M III MM II II I M 

Db 970 ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 102 9 

Qy 1223 TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 1282 

MIMI MM III I I III II Illlllll III I IIIIIIIIIII II 

Db 1030 TT(^TGCAGTTTTACAAGAC^G(^AAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 108 9 

Qy 1283 CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 1342 

M MM llllll I II II II II II 1 1 1 Mill llllll MM I 

Db 1090 CCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG - - - AGAAAG 114 6 

Qy 1343 AATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAAGCAGCGTCGAGAAGTGGCAAAA 14 02 

II I M M 1 1 II I I I III II M llllll I II Illlllll III 

Db 114 7 AAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAA 12 06 



Qy 14 03 ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGT 1462 

II II II III 1 1 1 1 I I Mill II 1 1 1 1 1 1 I II MINI I Ml I 

Db 12 07 ACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGG 1266 

Qy 14 63 ATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTC 1522 

II Mill III I Mill MM II II. Mill I I II II 

Db 1267 ATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTT 132 6 

Qy 1523 TTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCC 1582 

I II I MM II II Mill Ml MM I lllllll II II Mill 

Db 1327 CTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCA 1386 

Qy 1583 ATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGC 1642 

II MIIMIMM I [ 1 1 1 1 1 f I III Mill II II II II I M I I MUM 

Db 1387 ATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGC 144 6 

Qy 1643 TGCTG 1647 

Db 1447 TGGTG 1451 



RESULT 15 
US-09-826-509-496 

; Sequence 496, Application US/09826509 

; Publication No. US20030204073A1 

; GENERAL INFORMATION: 

; APPLICANT: Lehmann - Bruins ma , Karin 

; APPLICANT: Liaw, Chen W. 

; APPLICANT: Lin, I -Lin 

; TITLE OF INVENTION: No. US2 003 02 04 073A1 -Endogenous , Constitutively Activated 
Known G 

; TITLE OF INVENTION: Protein -Coupled Receptors 
; FILE REFERENCE: AREN-2 07 

; CURRENT APPLICATION NUMBER: US/09/826 , 509 

; CURRENT FILING DATE: 2001-04-05 

; PRIOR APPLICATION NUMBER: 60/195,747 

; PRIOR FILING DATE: 2000-04-07 

; PRIOR APPLICATION NUMBER: 09/170,496 

; PRIOR FILING DATE: 1998-10-13 

; NUMBER OF SEQ ID NOS : 58 9 

; SOFTWARE: Patentln Version 2.1 

; SEQ ID NO 496 

LENGTH: 132 9 

TYPE : DNA 

ORGANISM: Homo sapiens 
US-09-826-509-496 

Query Match 9.6%; Score 392.6; DB 13; Length 1329; 

Best Local Similarity 65.5%; Pred. No. 6.9e-87; 

Matches 632; Conservative 0; Mismatches 309; Indels 24; Gaps 3; 
Qy 68 9 TGCCCACAGCAGACTAAAATTACTTCAGCTTTCAAA.TACATTAACACTGTGATATCTTGT 74 8 

1 1 1 1 I II I II I IMIIIIIIIIM Mill II I II II 

Db 268 TGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATCAACACGGTTGTGTCCTGC 327 

Qy 74 9 ACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTACCAGAAC 808 

I I MM MM II I Mill I II II II II II II III Mill 



Db 328 CTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAAC 387 

Qy 8 09 AAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATC 868 

II II III I II II Mill I I I I Mill I II II I I I I E I I I I 

Db 388 AAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTG 447 

Qy 869 TATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTT 928 

I I llllllll I i 1 1 1 i 1 1 ! I I 1 1 1 1 1 1 1 1 1 ! E I I MMM 

Db 44 8 CACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCC 503 

Qy 92 9 GATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCG 988 

Mill I I II MMM Mill I Mill MM 

Db 504 ATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCC 552 

Qy 98 9 GTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCC 104 8 

Mill Mill II II I II II Mill Ml MIIMI II MM Mill 

Db 553 GTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCT 612 

Qy 104 9 TCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCC 1108 

II Illlll I II I IIMIIMI MM M II I llllllll I 

Db 613 TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 672 

Qy 1109 ATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCC 1168 

II Ml I I II I I Mill I llllllll II M III II I I 

Db 673 ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 732 
Qy 1169 TTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCC ACATCAAAA 1222 

llllllll II II III MM M II I II 

Db 733 ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 792 

Qy 1223 TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 1282 

MMM MM Ml I I III II IIIMIII Ml I MIMIIMM II 

Db 7 93 TTCATGCAGTTTTACAAGACAGC1AAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 852 

Qy 12 83 CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 1342 

II MM MMM I II II II II MMI Mill Illlll MM I 

Db 8 53 CCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG AGAAAG 909 

Qy 1343 AATGGCAG CTTGAGAATTG CC CT CAGTGAACATCTTAAG CAGCGTCGAGAAGTGG CAAAA 14 02 

II I II II Mill I I III II II MMM I II MMM III 

Db 910 AAAAGTGG CATG CAGATTG CTTTAAATGATCACCTAAAG CAGAGACGGGAAGTGAAGAAA 969 

Qy 14 03 ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGT 14 62 

II II II Ml MM I I Mill II Illlll I II MMM I III I 

Db 970 ACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGG 1029 

Qy 14 63 ATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTC 1522 

II Mill III I Mill I I II II llllllll I Mill 

Db 1030 ATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTT 1089 

Qy 1523 TTACTG CTCATGGATTACATCGGTATTAACTTGG CAACCATGAATTCATGTATAAACCC C 1582 

I II I MM II II II II I III MM I MMM M II MM 

Db 1090 CTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCA 114 9 

Qy 1583 ATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGC 1642 

II lllllllllll IIIMIII I Ml MM II II MMM I Illlll 

Db 1150 ATTG CTCTGTATTTGGTGAGCAAAAGATTCAAAAACTG CTTTAAGTCATG CTTATGCTG C 12 09 



Qy 1643 TGCTG 164 7 

JIM 

Db 1210 TGGTG 1214 
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ALIGNMENTS 



RESULT 1 
ABZ35259 

ID ABZ35259 standard; cDNA; 4105 BP. 
XX 

AC ABZ35259; 
XX 

DT 05-FEB-2003 (first entry) 
XX 

DE Human gene expression profile polynucleotide SEQ ID NO 370. 
XX 

KW Human; artery; endothelium; umbilical; vein; aorta; pulmonary artery; 

KW bronchial epithelium; prostate; muscle; lung fibroblast; osteoblast; 

KW tumour; microarray; genome mapping; antibiotic; antiviral; antifungal; 

KW gene expression; gene; ss. 
XX 

OS Homo sapiens. 
XX 

PN WO200274979-A2 . 
XX 

PD 26-SEP-2002. 
XX 

PF 20-MAR-2002; 2 002WO-US08456 . 
XX 

PR 20-MAR-2001; 2 001US-276947P . 
XX 

PA (ORTH ) ORTHO CLINICAL DIAGNOSTICS INC. 
XX 

PI Wan J, Wang Y; 
XX 

DR WPI; 2002-740862/80. 
XX 

PT . New gene expression profile generated from primary, endothelial, 

PT epithelial, and muscle cell types, useful for identifying disease 

PT pathologies involving alterations of gene expression, e.g. cancer - 
XX 

PS Disclosure; Page 513-515; 850pp; English. 
XX 

CC The invention relates to a gene expression profile comprising one or more 

CC genes (ABZ34889-ABZ35692) and generated from a cell type. The cell type 

CC is a coronary artery endothelium, umbilical artery or vein endothelium, 

CC aortic endothelium, dermal microvascular endothelium, pulmonary artery 

CC endothelium, myometrium microvascular endothelium, keratinocyte 

CC epithelium, bronchial epithelium, mammary epithelium, prostate 

CC epithelium, renal cortical epithelium, renal proximal tubule epithelium, 

CC small airway epithelium, renal epithelium, umbilical artery smooth 

CC muscle, neonatal dermal fibroblast, pulmonary artery smooth muscle, 

CC dermal fibroblast, neural progenitor cells, skeletal muscle, astrocytes, 

CC aortic smooth muscle, mesangial cells, coronary artery smooth muscle, 

CC bronchial smooth muscle, uterine smooth muscle, lung fibroblast, 

CC osteoblasts or prostate stromal cell. The gene expression profile is used 

CC for determining the level of RNA expression for a sample, determining the 

CC phenotype of a cell and distinguishing cell types. The gene or a protein 

CC expression profile is useful in identifying disease pathologies 

CC involving alterations of gene expression. The assessment of expression 

CC profiles may provide meaningful information with respect to tumour type 

CC and stage, treatment methods, and prognosis. The gene or protein 



CC expression profile may also be used for creating microarrays . The 

CC microarray is useful for genetic and physical mapping of genomes, DNA 

CC sequencing, genetic or medical diagnosis, genotyping of organisms, 

CC confirming cell or tissue identifications and in identifying promising 

CC antibiotics, antiviral or antifungal agents. 

XX 

SQ Sequence 4105 BP; 1138 A; 859 C; 845 G; 1263 T; 0 Other; 

Query Match 100.0%; Score 4105; DB 24; Length 4105; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4105; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

GAATT CG CGG CCG CCTCTTGCGGTC CCAGAGTGGAGTGGAAGGT CTGGAG CTTTGGGAGG 6 0 

IIIMIIIIIIIIIIMIIIIMIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 6 0 
AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

IIMIIIIMIIIMMIIIIIIMIIMMIIIIIMMIIIIIIIIMIIIIIIIMI 

AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 
CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 180 

IIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIMIIIIIMIIIIMIIIMIIIIMI 

CG CG CG CG CGTACAGTCATCCCG CTGGT CTGACGATTGTGGAGAGG CGGTGGAGAGG CTT 180 
CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

MIIIMIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIII 

CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 
GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 3 00 

IMIIIMIIIIMIIIIIIIIIIIIIIIIMIIMMIIIIMIIIIIIIMMIIIII 

GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 3 00 
CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

MIIIJIIIIIIII III MINIM MM IIIIMIIIIIIMIIIIIIIIIIIIIIIII 

CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 
TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 

MMMMMMMIMMMMMMMMIMMMMMMMMMMMIIMM 



Qy 


l 


Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


jDI 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 



IIIIIMIIIIIIMIIIMIIIIIIMIIIIIIIIIIIIIIIIMMIIIIIIIMI 



480 



CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

MMMMMIMMMMMIMMMMMMMMMMMMMMMMIIMM 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 
CAGTGATAAT CCTGAGAGATACAG CACAAATCTAAG CAATCATGTGGATGATTTCAC CAC 600 

MM II Ml II II II 1 1 III II MMMIIMI MM MM III I Mill II II II II 1 1 

CAGTGATAATCCTGAGAGATACAGCAC^^TCTAAGC^TC^TGTGGATGATTTCACCAC 600 

TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

I I I I I I I M I I I I I I I I I I II I I I I I I I I II I II I I I I I I I I I I I I I I II I I I I II I I I I 
TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

661 ACCC^GCAATGGCTCAATGC^CAACTATTGCCC^ 72 0 



Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 



II 1 1 1 1 II 1 1 1 1 I I II II 1 1 1 1 1 M II 1 1 1 1 1 1 1 Ml M I 1 1 1 M 1 1 1 1 II 1 1 M I 

ACCCAGCAATGGCTCAATGCACAACTATTGCCCACAGCAGACTA^ 72 0 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 

MIIIIIIIIIIIIIIIIIIIIMIIMIIIIMIIIMIIIIIMIIIIIIIMIMII 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

MIIIMIIIMIIIMIIIIIMIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIMII 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

MIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMMIIIIMIIIIIIIII 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 IM 1 1 1 1 1 1 1 1 II M I Ml I M I II 1 1 1 1 M I II Ml 1 1 

TAAG CTG CTGG CTGGG CG CTGG CCTTTTGAT CACAATGA CTTTGGCGTATTTCTTTG CAA 960 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

! 1 1 M 1 1 1 1 M 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II M 1 1 1 Ml 1 1 1 1 1 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

1 1 II 1 1 1 III 1 1 1 I III III 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1. 1 1 1 1 1 1 MM I Mill 1 1 1 II 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

II I M I II 1 1 I M 1 1 1 1 1 1 1 III 1 1 1 1 1 II II II M 1 1 III I II 1 1 II II 1 1 MM 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

I MINI MINI Ml MM MM II II II II II IN MINI II II MM II MM II I 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 

1 1 1 II I II 1 1 II II I M I M 1 1 1 1 II II II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 

12 01 TATGCTCAATGCCACATCAAAATTCATGGAGTTCTA'CCAAGATGTAAAGGACTGGTGGCT 12 60 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II II 1 1 II II 1 1 II I 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 138 0 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIMIII 

TTGTGAGATGTTGAACAGAAGGAATGG CAG CTTGAGAATTGCC CTCAGTGAACATCTTAA 1380 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

MIMMMMMMMMIMMIMIMMMIMMIMIMMMMMMMIM 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

GTTCCCTCTT CACTTAAG CCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

IIMMMIMMMIMMIIMMMMMMMMMIMIMIMMMIIIMM 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

M 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


" 1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db , 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Ov 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 



1501 CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 
CATGAATTCATGTATAAAC C CCATAG CTCTGTATTTTGTGAG CAAGAAATTTAAAAATTG 162 0 

Mllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

CATGAATTCATGTATAAAC CCCATAG CT CTGTATTTTGTGAG CAAGAAATTTAAAAATTG 162 0 
TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

IIIIIIIIIIIIIIIIIIIIMIIIMIIMIIIIIIIIMIIIIIMIIIIIMMMI ' 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 
CATGAACGGAACAAGC^TCCAGTGGAAGAACCACGA 174 0 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 

CATGAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACCACAACA 174 0 

GAGC^GCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

i J J 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 r 1 1 r 1 1 1 1 1 j j 1 1 r i r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r [ i 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 18 00 
AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIII 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 
CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATC 1920 

IIIIIIIMMMIMIMIMIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIjlllM 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 
CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 1980 

MIMIIIIIIIIIIIMIIIIMIIIIIIIIIIMIMIMMIIIIIIMMIIIMI 

CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 198 0 
TTCTG CGTGTTGTATT CAG CACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 

IMIIIIIMIMIIIIMMIIIIMIIIIMIIIIIIIIIIIIMIIIIMIIIMM 

TTCTG CGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 
AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 

IMMIIIIIIIIIIIIIIMIMIIIIIIIIMIIIIIIIIIIIIIIIIIMIIlllll 

AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 

CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 

i : ! ^ i I = - i i I I I ' 

CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 
ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 2280 

1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 228 0 
TAATAGCCTAAC^TGATTATTTGAACTTATTTACAC^TAGTTTGAAAAAAAAAAGACAAA 234 0 

IMMIIIMIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIM 

TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 
AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 

IIMIIIIIMIMIIIIIIIIIIIMIIMMIIIIIIIIIIIMIIIIMIIIIIIII 

AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 



Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 



ACACAAATT CTAAAGCTACAACAAATACTACAGG C CCTTAAAG CACAGT CTGATGACACA 24 60 

IIMIIIIIIIIIMIIIIIIIIMIIMIIIIIIIIIIIIMIIIMMIIIIIIIIII 

ACACAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGAa\CA 24 60 
TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 252 0 

MM III IM M MM Ml II M lir M I! II 

TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 252 0 
ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2580 

M Ml II iMI MM II. Mill II ML II Ml | Ml 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 258 0 

TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I M I I I I I I I I 
TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 

AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 

IIIIIIIIIIIIMIIIMIIMIIMIIIIIIIIIIIIIIIIMIIIMIIIIIIIIM 

AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 27 00 
TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 

IMI.IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 
ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 2820 

IIIIIIIIMIIIIMIIIIIIMIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIM 

ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 282 0 
CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 2880 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 

CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 288 0 
TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 

IMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 
GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3000 

IMIIIIIIMIIIIMIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMI 

GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3 000 
TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 3 060 

IIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIMIIIIIMIIIIIIIIII 

TCAGTATGAACCTAACTCCCCACCCC^CATCTCCCTCCCACATTGTCACC^TTTCAAAG 3060 
GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 312 0 

IIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIII 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 312 0 
AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 3180 

M IM M Ml Mil Ml III IM, II Ml III MM I 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 318 0 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 

I I M I I I I I I M I I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I || | I | | | | | | 
ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 



Qy . 


3241 


GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 

1 1 M 1 M II Ml M 1 II 1 1 M 1 1 1 1 1 M 1 1 1 M 1 1 1 II M 1 1 1 1 1 II MM M II 1 

GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 


3300 


Db 


3241 


3300 


Qy 


3301 


GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 

IIIIIIMIIIMIIIIIIIIIIMIIIIIIIIIIMIIIIIMIIIIIMIIIIIIIM 

GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 


3360 


Db 


3301 


3360 


Qy 


3361 


TGTAGCAACCTTCTGCATTCATAAATCTTGTAATC^TGTTACC^TTAC^AATGGGATATA 

llllllilllllllllliiiiiiiiiiiiiiiii.it 

1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 

TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 


3420 


Db 


3361 


3420 


Qy 


3421 


AGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 

1 1 1 1 M 1 1 M 1 M 1 II II M 1 1 1 1 1 1 1 II 1 M 1 1 1 II II 1 1 1 II III Ml 1 II M M 

AGAGG CAGCGTGAAAGCAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGT 


3480 


Db 


3421 


3480 


Qy 


3481 


TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 

IIMIIIIMIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIMIMIIIIIIIII 

TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 


3540 


Db 


3481 


3540 


Qy 
Db 


3541 
3541 


CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 
1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 I || | I | | | | | | | | || | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 


3600 
3600 


Qy 


3601 


TGTCTGATATTTCTTTCAGACTTCGC CAGACAGATTG CTGATAATAAATTAGGTAAGATA 

Itlllllllll1lillliiiiiiiiiiiii4*iiii*.. 

IIIMIIIIIMIMIIIIIIIIIIIIIMIIIIIMIMIIIIIIIIIIIMIIIIMI 

TGTCTGATATTT CTTTCAGACTTCGCCAGACAGATTG CTGATAATAAATTAGGTAAGATA 


3660 


Db 


3601 


3660 


Qy 


3661 


ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 

1 M 1 1 1 1 1 1 1 1 1 1 1 Ml 1 I 1 1 1 1 l! Ml 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 

ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 


3720 


Db 


3661 


3720 


Qy 


3721 


CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 

llllltlllllliiiiiitiiiitiiiiiiiiiitttiiii*.... ...... 

MMMIMMMMMMMMMN! MMMM Mill II 1 1 IMIMIMM 

CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 


3780 


Db 


3721 


3780 


Qy 


3781 


CATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 
1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I I I I I | | || | | | | | 
CATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 


3840 


Db 


3781 


3840 


Qy 


3841 


ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 
i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i > i i • i 

1 MM 1 1 1 1 1 1 1 M 1 II! M II M M 1 1 1 1 M 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 

ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 


3900 


Db 


3841 


3900 


Qy 


3901 


ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 

M II II II Ml II II MINI Mill III II MM II III II II II Mill MM III II 

ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 


3960 


Db 


3901 


3960 


Qy 


3961 


TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 

M 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 M M 1 1 M 1 1 1 1 M 1 1 II II 1 1 

TAACTGTGGTTTACTAGfAnnAATATTTrPZiaTTTPTapPT'T'Ta pts m\ r rn<T>nv-nr nr iT i a r^A 


4020 


Db 


3961 


4020 


Qy 


4021 


AGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTG 

MMMIIMiMMMMMMMMMMMMMMMMIIMMMIMMIMI 

AGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTG 


4080 


Db 


4021 


4080 


Qy 


4081 


TAAAATAAAAGTTTACAGAAACCTT 4105 





Db 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 

RESULT 2 
ABV94238 

ID ABV94238 Standard; cDNA; 4105 BP. 
XX 

AC ABV94238; 
XX 

DT 08-JAN-2003 (first entry) 
XX 

DE Breast carcinoma related nucleotide sequence SEQ ID NO:229. 
XX 

KW Human; breast carcinoma; cancer; tumour; cytostatic; anti-tumour; 

KW gene; ss. 

XX 

OS Homo sapiens . 
XX 

PN WO200246467-A2 . 
XX 

PD 13-JUN-2002. 
XX 

PF 07-DEC-2001; 2001WO-IB028 11 . 
XX 

PR 08-DEC-2000; 2000US-254 090P . 

PR 07-DEC-2001; 2001US- 0007926 . 
XX 

PA . (IPSO-) IPSOGEN. • 
XX 

PI Bertucci F, Houlgatte R, Birnbaum D, Nguyen C, Viens P, Fert V; 

XX . 

DR WPI; 2002-619023/66. 
XX 

PT Novel polynucleotide library useful in molecular characterization of a 

PT carcinoma, comprising a pool of polynucleotide sequences or its 

PT subsequences which are either underexpressed or overexpressed in tumor 

PT cells - 

XX 

PS Claim 1; Page 258-259; 401pp ; English. 
XX 

CC The present invention describes a polynucleotide library (I) useful in 

CC the molecular characterisation of a carcinoma, comprising a pool of 

CC polynucleotides or its subsequences which are either underexpressed or 

CC overexpressed in tumour cells, and correspond to any of the 

CC polynucleotide sequences chosen from the 468 sequences given in ABV94010 

CC to ABV94477. Also described: (1) a polynucleotide array (II) useful for 

CC the prognosis or diagnostic of tumour, comprising (I); and (2) detecting 

CC (Ml) differentially expressed polynucleotide sequences which are 

CC correlated with a cancer,, involves obtaining a polynucleotide sample from 

CC a patient, and reacting the polynucleotide sample obtained with a probe 

CC immobilised on a solid support, where the probe comprises any combination 

CC of the polynucleotide sequences of (I) or its expression products encoded 

CC by polynucleotide sequences of (I), and detecting the reaction product. 

CC (I) have cytostatic activities and can be used as anti-tumour agents. (I) 

CC is useful in molecular characterisation of a carcinoma. (I) and (II) are 

CC useful for the prognosis or diagnostic of tumour, in differentiating a 



CC normal cell from a cancer cell, detecting a hormone sensitive tumour 

CC cell, differentiating a tumour with lymph nodes from a tumour without 

CC lymph nodes, differentiating antracycline-sensitive tumours from 

CC antracycline-insensitive tumours, and classifying good and poor prognosis 

CC primary breast tumours. (I) is useful for large-scale molecular 

CC characterisation of breast cancer that help in prediction, prognosis and 

CC cancer treatment, and for detecting differentially expressed genes that 

CC correlated with a cancer. 

XX 

SQ Sequence 4105 BP; 1138 A; 859 C; 845 G; 1263 T; 0 other; 

Query Match 100.0%; Score 4105; DB 24; Length 4105; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4105; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 60 

MM I II II I MM Ml II II III MM II I II III MM Mill I MINI Ml 1 1 Ml 

Db 1 GAATTCG CGG CCGC CT CTTGCGGTCC CAGAGTGGAGTGGAAGGTCTGGAG CTTTGGGAGG 60 

Qy 61 AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

1 1 1 1 1 M ! 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 ! 1 1 ! I I i II II 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 

Db 61 AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

Qy 121 CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 180 

1 1 1 M I M I M M M I M I II M 1 1 M 1 1 M 1 1 M I M I M 1 1 II M II 1 1 1 M I II M I 

Db 121 CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 180 

Qy 181 CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

I M 1 1 II II I II II I II 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II M 1 1 1 M M 1 1 1 II I M 

Db 181 CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

Qy 241 GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 300 

M I M II I I I II I II I I II II I I I I I I I M I I I II I II I M I I II II I I I I I II I I II II 
Db 241 GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 3 00 

Qy 3 01 CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 3 60 

M I M 1 1 II M I II II M II 1 1 1 II I II 1 1 1 1 1 1 M I II 1 1 M II I II I M 1 1 II II 1 1 1 

Db 301 CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

Qy 3 61 TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 

I M 1 1 1 1 1 1 1 II II I II II II 1 1 1 1 M 1 1 II I II 1 1 II 1 1 II 1 1 1 II 1 1 M I II I M II I 

Db 361 TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 

Qy 421 AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAA 48 0 

i: IM Ml Ml Ml ■ II III MM M' III IM Ml MM 

Db 421 AAAAAGTGAAGGTGTAAAAG CAG CACAAGTG CAATAAGAGATATTTCCTCAAATTTG CCT 48 0 

Qy 481 CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

MM IMMIM,MM II Mlh IM II MM Ml IM; M 

Db 481 CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

Qy 541 CAGTGATAATCCTGAGAGATACAGCACAAATC 600 

M II MM Ml : Ml Ml MM II, Ml .lll-lll' II ; M 

Db 541 CAGTGATAATCCTGAGAGATAC^GCACAAATCTAAGCAATCATGTGGATGATTTCACCAC 600 

Qy 601 TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

MMIIIIIMIIIIIIIIMIIIIIIIIIIIIIMMMIIIMIIIIIIIIIIIMM 



Db 601 TTTTCGTGGC^CAGAGCTC^GCTTCCTGGTTACCACTC^TCAACCCACTAATTTGGTCCT 660 

Qy 661 ACCCAGCAATGGCTC^TGC^CAACTATTGCCGVCAGCAGAGTAAAATTACTTC^GCTTT 72 0 

I i I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I II I I I I I I II I I I I II I I I I I I I I 
Db 661 ACCCAGCAATGGCTCAATGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTT 72 0 

Qy 721 CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 

IIIMIIIIIIIIIIMMIIIIIIIIIIirillllllllllllllllllMIIIIIIII 

Db - 721 CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 

Qy 781 AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

IIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 781 AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

Qy 841 AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

MIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIMIMII 

Db 841 AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

Qy 901 TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I II I I II I I I I I I I I I 
Db 901 TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

Qy 961 GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 1020 

1 1 1 1 1 1 ' 1 1 1 1 1! 1 1 1 M I II Ml MINI II, 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 i I II 1 1 ii 1 1 

Db 961 GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

Qy 1021 TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 Ml I Mill III II II 1 1 1 1 1 1 1 1 1 ! 1 1 1 II 1 1 1 1 1 M 1 1 

Db 1021 TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

Qy 1081 TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

1 1 1 II M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 II 1 1 1 1 II 1 1 1 II I II I M 

Db 1081 TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

Qy 1141 TGAAG CGATTGGCTTCGT CATGGTACC CTTTGAATATAGGGGTGAACAGCATAAAAC CTG 12 00 

IMMMMMMMMIMMMMMMMMIMMMMMIMMMIMIMM- 

Db 1141 TGAAG CGATTGGCTTCGT CATGGTAC C CTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

Qy 12 01 TATGCTCAATGCCAC^TCAAAATTCATGGAGTTCTACC^AGATGTAAAGGACTGGTGGCT 12 60 

IIMIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

Db 12 01 TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 12 60 

Qy 1261 CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 1320 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I I I I I I I I I I I 

Db 1261 CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 1320 

Qy 1321 TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 138 0 

II II II I II II MM II Mill II II II III II MM II II II III II MM II III II I 

Db 1321 TTGTGAGATGTTGAACAGAAGGAATGG CAG CTTGAGAATTG CCCTCAGTGAACAT CTTAA 138 0 

Qy 1381 GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

II II II I MM II II II II I II MM MM III II II II I MM I Ml 1 1 II MM MM 

Db 1381 GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

Qy 1441 GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

II I I II II I II I II I I II II II II II II I I II I I I II I I II I I II II I II I I II I II I I I 

Db 1441 GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 



Qy 


■ 1501 


Db 


1501 


Qy 


1561 


Db ■ 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 



CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

II 'II IT II II II III II MM II II j| Ml 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 
CATGAATTCATGTATAAAC CCCATAG CTCTGTATTTTGTGAG CAAGAAATTTAAAAATTG 162 0 

:M MMMM Ml MM Ml MM IM III II I II M |j 

CATGAATTCATGTATAAAC CCCATAG CTCTGTATTTTGTGAG CAAGAAATTTAAAAATTG 162 0 
TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

INN I MINIM I II II I III I MM MINIMI Mill II II Mill III I Mill 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 168 0 
CATGAACGGAACAAGCATCCAGTGGAAGAACCAC^^ 174 0 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIMIIIIIIIIIIIIIIIMMIII 

CATGAACGGAACAAGCATCCAlGTGGAAGAACC^ 174 0 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

IIIIIIIIMIIIIIIMIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIMIMIIIII 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 18 00 
AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

IMIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIMI 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 
CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 

1 1 1 i 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 
CTGGTTTATCC^VCCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 1980 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 198 0 
TT CTGCGTGTTGTATT CAG CACTAAAAAATGGTGGGAG CTGGGGGAGAATGAAGACTGTT 204 0 

IIIIIIIIMIIIMIIirillllllllllllllllllMIIIIIIIIIIIIIIIIIIII 

TTCTGCGTGTTGTATTCAG CACTAAAAAATGGTGGGAG CTGGGGGAGAATGAAGACTGTT 2 04 0 

AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 

I I H M I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AAATGAAACC^GAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 

CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 
I M I I I I I I I II I || I I I | | || | | | | | | | | | | | | | | | M I I I I I I I I I I I I I I I I I I I I I 
CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 

Ml MM IM IM III MMMM MM II MIM IM I! I 

AGATTAACGAC^AGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 
ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 228 0 

MIIIMIMIIIIIIIIIIIIIMIIMIIIIMMIIIIIIIIMIIIMIMIIIII 

ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 22 8 0 
TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 

IIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIII 

TAATAGCCTAACATGATTATTTGAACTTATTTAC^C^TAGTTTGAAAAAAAAAAGACAAA 234 0 



Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Ov 


O \J O X 


Db 


3061 


Qy 


3121 


Db 


3121 



AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 

M Ml II IM I! Ml IMMIM M Ml I II II III I 

AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 

AC^CAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCAC^GTCTGATGAC^C^ 24 60 

INI I MM INI INI 1 1 Ml II II IIIMIIIIIMI MM Mill I III Mill III 

ACACAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACA 2460 

TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 252 0 

Mill MM MM MM MMM III IMMIIM Mil MM Mill I III MM I III 

TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 2520 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2580 

MM I MM MM I III I MM I III Mill MM MM MM I MM MM III I MM 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2580 

TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 

M ll II III 1^ IM IM II IM Ml I M IM II ' I 

TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 

AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 

I MM IM IM IM II Ml Ml MMMI I II MM M I I 

AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 27 00 

TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 

MIIMIIIMIIIIIIIIIIMIIIIIMIIIIMIIMIMIIIIIIMIIIIIIIII 

TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 

ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 2820 

MMIIIIIIIIIIIIIMIIMIMMIIIIIIIIMIIMIIIMIIIMIIIIIIM 

ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 2820 

CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 2 88 0 

M MMM M II IM II MM IM II ,1 Ml II I I 

CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 288 0 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 294 0 

M MMMI II M IM IM MM II IM MMM II ' I 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 294 0 

GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3000 

M i II II II II II II II! II IM JMMM M ' I 

GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3000 

TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 3060 

1 1 II I II I II I II II II II I II 1 1 II 1 1 II II 1 1 1 II II II II II I II 1 1 1 1 II I II 1 1 1 

TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 3060 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 3120 

I Ml II II MMMI MM MI M Ml MM Ml MMM 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 312 0 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 3180 

I II Ml Ml M MMI III MMMI IM MMM II 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 318 0 



Qy 



3181 ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 



Db 3181 ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 

Qy 3241 GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 33 00 

IIMMIIMIIIMIIIIMIIIIIIIMIIMIIIIIIIIIIIIIMMIIIIMIII 

Db 3241 GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 33 00 

Qy 33 01 GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 

IMIIMIIIIIIMIIIIIIIIIIIIIIIIMIIMIIMIIIIIIIIMIIIIIIIII . 

Db 33 01 GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 

Qy 3361 TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 3420 

IIIIIIIMIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIMIIIMI 

Db 33 61 TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 342 0 

Qy 3421 AGAGG CAGCGTGAAAG CAGATGAGCTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGT 3480 

IMIIIIIIIIIIIIIIIIIIMMIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIM 

Db 3421 AGAGG CAGCGTGAAAG CAGATGAGCTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGT 3480 

Qy 34 81 TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 

IIMIII Ml III IM IM II III MM II Ml II MMM 

Db 34 81 TGGTTTGATAAAGCAGTATTTGGGGTCA.TATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 

Qy 3 541 CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

IMMM Ml Ml III II II III MM II II II Ml M 

Db 3 541 CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

Qy 3601 TGTCTGATATTTCTTT CAGACTTCG CCAGACAGATTG CTGATAATAAATTAGGTAAGATA 3660 

IIIIIIIIIIIMIIIIIIIIMMIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIII 

Db 3601 TGTCTGATATTTCTTT CAGACTTCGC CAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 

Qy 3661 ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 3720 

IIIIIIIIMIIIIIMIIIIIMIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIII 

Db 3 661 ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 3720 

Qy 3721 CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3780 

IM Ml Ml Ml II II II III MMM Ml II MM I 

Db 3721 CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3780 

Qy 3781 CATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 

MM Mill I III MM I III I MM MM Hill MM III HIM I INI INI MM 

Db 3781 CATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 

Qy 3 841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3 900 

I MIM IM IM II IMMIIM MM II II: II Mill II I 

Db 3841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3900 

Qy 3 901 ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3960 

II Ml III Ml IM II II III Ml II Ml : II IM II 

Db 3 901 ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3 960 

Qy 3 961 TAACTGTGGTTTACTAGC^GGAATATTTCCAATTTCTACCTTTACTAC^TCTTTTC^ACA 4 020 

I : IM II Ml Ml ,MM II MM II IM III Ml II M . 

Db 3 961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 

Qy 4 021 AGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTG 4 08 0 

IMIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIMIMIIIIMIIIIIIIM 



Db 4 021 AGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTG 4 080 



Qy 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 

IIIIMMIIIIIIIIMIIIIIII 

Db 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 



RESULT 3 
ABZ42662 

ID ABZ42662 standard; DNA; 4105 BP. 
XX 

AC ABZ42662; 
XX 

DT 04-MAR-2003 (first entry) 
XX 

DE Human endothelin A receptor nucleotide SEQ ID NO: 115. 
XX 

KW G protein-coupled receptor; GPCR; antigenic peptide; gene therapy; 

KW G protein-coupled receptor modulator; antibody; immune- related disease; 

KW. growth-related disease; cell regeneration-related disease; AIDS; cancer; 

KW immunological -related cell proliferative disease; autoimmune disease; 

KW Alzheimer's disease; atherosclerosis; infection; osteoarthritis; allergy; 

KW osteoporosis; cardiomyopathy; inflammation; Crohn's disease; diabetes; 

KW graft versus host disease; Parkinson's disease; multiple sclerosis; pain; 

KW psoriasis; anxiety; depression; schizophrenia; dementia; memory loss; 

KW mental retardation; epilepsy; asthma; tuberculosis; obesity; nausea; 

KW hypertension; hypotension; renal disorder; rheumatoid arthritis; trauma; 

KW ulcer; gene; ds . 

XX 

OS Homo sapiens. 
XX 

PN WO200261087-A2 . 
XX 

PD 08-AUG-2002. 
XX 

PF 19-DEC-2001; 2001WO-US50107 . 
XX 

PR 19-DEC-2000; 2000US-257144 P . 
XX 

PA (LIFE-) LIFESPAN BIOSCIENCES INC. 
XX 

PI Burmer GC, Roush CL, Brown JP; 
XX 

DR WPI; 2003-046718/04. 

DR P-PSDB; ABP81816. 
XX 

PT New isolated antigenic peptides e.g., for G protein-coupled receptors 

PT (GPCR) , useful for diagnosing and designing drugs for treating 

PT conditions in which GPCRs are involved, e.g. AIDS, Alzheimer's disease, 

PT cancer or autoimmune diseases 

XX 

PS Disclosure; Fig 1; 523pp; English. 
XX 

CC The present invention describes antigenic peptides (I) comprising: 

CC (a) any one of 1601 sequences (see ABP82019 to ABP83619) of 12-24 amino 

CC acids. Also described: (1) an assay for the detection of a particular 

CC G protein-coupled receptor (GPCR) or a candidate polypeptide in a sample; 



CC and (2) an isolated antibody having high specificity and high affinity 

CC or avidity for a particular GPCR. (I) can be used as GPCR modulators and 

CC in gene therapy. The antigenic peptides for GPCRs are useful in detecting 

CC an antibody against a particular GPCR, and in the production of specific 

CC antibodies. The peptides and antibodies are also useful for detecting the 

CC presence or absence of corresponding GPCRs. The antigenic peptides for 

CC GPCRs and antibodies are useful for diagnosing and designing drugs for 

CC treating immune-related diseases, growth-related diseases, cell 

CC regeneration-related disease, immunological -related cell proliferative 

CC diseases, or autoimmune diseases, e.g. AIDS , Alzheimer's disease, 

CC atherosclerosis, bacterial, fungal, protozoan or viral infections, 

CC osteoarthritis, osteoporosis, cancer, cardiomyopathy, chronic and acute 

CC inflammation, allergies, Crohn's disease, diabetes, graft versus host 

CC disease, Parkinson's disease, multiple sclerosis, pain, psoriasis, 

CC anxiety, depression, schizophrenia, dementia, mental retardation, memory 

CC loss, epilepsy, asthma, tuberculosis, obesity, nausea, hypertension, 

CC hypotension, renal disorders, rheumatoid arthritis, trauma, ulcers, or 

CC any other disorder in which GPCRs are involved. The antibodies may be 

CC used in immunoassays and immunodiagnosis . ABZ42523 to ABZ42869 encode 

CC GPCR proteins given in ABP81675 to ABP82018, which are used in the 

CC exemplification of the present invention. 

XX 

SQ Sequence 4105 BP; 1138 A; 859 C; 845 G; 1263 T; 0 other; 

Query Match 100.0%; Score 4105; DB 25; Len f gth 4105; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4105; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 60 

iiiiiiiiiiiiiiimiimiiiiiiiiiiiiiiiiimmiiiiiiiiiiiiimiii 

Db 1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 60 

Qy 61 AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

IM II II Ml II I MM II II M M III M II Ml II 

Db 61 AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 120 

Qy 121 CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 180 

IIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIMIIIM 

Db 121 CG CG CGCG CGTACAGTCATCC CGCTGGTCTGACGATTGTGGAGAGG CGGTGGAGAGG CTT 180 

Qy 181 CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

II M MMI II II M II MM M II MM I II, II II 

Db 181 CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

Qy 241 GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 3 00 

IIIIIIIIMIIIIMIIIMIIMIIIMIMIIIIIIIIIIIMIIIIIIIIIIIIII 

Db 241 GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 3 00 

Qy 301 CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 3 60 

MIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMMIIIIIIIIIIIIIIIIMII 

Db 301 CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

Qy 361 TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 

IIIMIIIMIIIIIIIMMMIMIMIIIIMIIMIIIMMIIIIIIIIIIIIM 

Db 361 TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 



Qy 



421 AAAAAGTGAAGGTGTAAAAGCAGCAC^GTGCAATAAGAGATATTTCCTC^AATTTGCCT 480 



Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 



II I Ml III II MM I MMM IM II II MMM Mill II 

AAAAAGTGAAGGTGTAAAAGC^GCACAAGTGCAATAAGAGATATTTCCrCAAATTTGCCT 480 



i m inn ii i nu :;, ii : i ii ii ii i ,ii 



CAGTGATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCAC 600 

II I II III II MM I II 1 I 11 II II II III II I ill 

C^UjTGATAATCCTGAGAGATACAGCACAAATCTAA 600 
TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

M I Ml III II MM I MM II II Ml MlMMMI IM I III 

TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 
ACCCAGCAATGGCTC^TGCAC^CTATTGCCCACAGCAGACTAAAATTACTTCAGCTTT 72 0 

Mill lllllllllllilll IIMIMIMIIIIIII IIMIIIIMIIII] Ml 

ACCC^GCAATGGCTC^TGCAC^ACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTT 720 
CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 

M I M MM MM I IM I II II M II IMMI I I 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 
AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIMIMIIIIIIIIIIIII 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 
AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 90 0 

i!i ;l ::i II I I! I II I 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 
TAAG CTG CTGG CTGGG CG CTGG C CTTTTGATCACAATGACTTTGG CGTATTTCTTTG CAA 960 

I MMM II IM II M II I II MM I In I M I M 

TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 
GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

I MM M IM II M II I II MM I IM I IMM IM 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 
TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

III Mill MM III II II II I MM II Mill II Mill MM II III II II II Mill 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 
TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

I I II Mill IMM IMM IM II Ml II IMMI I II 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 1140 
TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

I II MMMI M M t M M II II I II I Ml I II 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 
TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 12 60 

IM I II II M MM I 'M I Ml IM I MM I IM Mill 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 
CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

I II MMMI IM II I II M IMMI M II MMM II II 



Db 



12 61 CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 



Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1 QD1 
J- J O -L 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 



TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 1380 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 138 0 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

! I III M Ml I Ml I I! I MIMI Ml IM I MM I M 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 
GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

IIIMIIIMIIIIIIIMIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIMIIII 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 
CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

I I Ml M MM I Ml I II I MIMMII Mill MM I M 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 
CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 162 0 

I I II M IMM Ml I II I MIMI III Mill MM I I 

CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 162 0 
TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 168 0 

M I II I IM I II II III I MIMI MIMMII MM I I 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 
CATGAACGGAACAAGCATCCAGTGGAAGAAC^ 174 0 

MM lllllllllllllllllllllllll IIIIIMIIIIIIIIMI IIMIIIIM 

CATGAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACCACAACACAGACCG 174 0 
GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 18 00 

i : i ; i i I I I ! I I 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 18 00 
AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

II II II II I IM I II M II III II IMM IMM Ml 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 
CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 

I i III I MMM. Ml I IMM Ml I MIMI lull I I 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 
CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 1980 

M I II M MmM III M IMM IM I MIMI Mill I I 

CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 1980 



i MMM I Ml 11 Ml I M I Ml iM II II M IMM Ml 

TTCTG CGTGTTGTATTCAG CACTAAAAAATGGTGGGAG CTGGGGGAGAATGAAGACTGTT 
AAATGAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATGG 

MMIIMIIIIIIMIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIMIIIIMIII 

AAATGAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATGG 
CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 

Ml M I IMM II M IM I IM I MIMIMIIMI I M 



Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 



AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 2220 

I MINI Mi 1 1 1! MM III MM MM IIMIIMII! INI Ml II II Ml Mill 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 
ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 22 80 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 

ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 228 0 
TAATAGCCTAACA.TGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIMIIIIIIIIMIIIIIIIIII 

TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 
AATAGTATTCAGGTGAG CAATTAGATTAGTATTTTC CACGTCACTATTTATTTTTTTAAA 24 00 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AATAGTATTCAGGTGAG CAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 
ACACAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGC^ 24 60 

IIIIIIIIIIIIIMIIIMIIMIIIIMIIIIIIIIIMMMIIIMIIMIIIII! 

ACACAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACA 24 60 
TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 2 520 

1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 2520 
ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2 580 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2 580 
TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 

1 1 1 1 1 ! 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 
AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 27 00 

1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 2820 

f M 1 1 1 1 1 1 1 1 1 J 1 1 j I II 1 1 1 ! M 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 2 82 0 
CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 288 0 

Mi M M. II IMM Ml II II. I IM II II II MM II 

CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 28 8 0 
TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 

1 1 1 1 1 J I J I i 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 J 1 1 1 1 1 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 294 0 



■ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j m ii 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 r i j 1 1 1 1 1 



Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 



TCAGTATGAACCTAACTCCCCACCCCAAC^TCTCCCTCCCACATTGTCACCATTTCAAAG 3060 

I ill I 1 II II II Mill II' II II MMM MMIIil | 

TCAGTATGAACCTAACTCCCCACCCC^C^TCTCCCTCCCACATTGTCACCATTTCAAAG 3060 
GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 312 0 

M Ml M II II II MMI IM IMM MMI MMIIII I 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 3120 
AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 318 0 

M MIMI || I || Ml II II Ml II IMIMMi IMM I 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 3180 
ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 

IIIIIIIIMIIMIIIIIIIIMMIIIIIIIIMIMIIIIMIIMIIMIIIIIII 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 



M IM : II I II M II II I II I MIMM M I I 



GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 

IMIIIIIII!II!IIIII||||MIIIIIMIIIIIIIIIIII||||||||||||MII 
GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 

TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 342 0 

MIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIMIIIMIII 

TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 342 0 
AGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 3480 

Ml Ml I MM IMIIMI II II III IMIIMII I III Ml 

AGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 34 80 
TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 

IIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 
CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

I M - M II M I MMM M II M IMM II M I 

CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 
TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 

M M II M M I II I II IM I II M II M M M I 

TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 
ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 372 0 

1 I II I II I II I Ml IM I Ml M II M II M I 

ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 372 0 
CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3780 

I MMI II II Ml Ml II II I MmI MM I IM I Ml 

CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3780 
CATCATATGCAGTGATATATG C CTATAATATAAG CCATAGGTTCACACCATTTTGTTTAG 384 0 

I M Ml 1 1 : 1 1 1 I 1 1 M 1 1 1 i l , 1 1 l ..IE 1 1 1 1 

CAT CATATG CAGTGATATATG CCTATAATATAAGCCATAGGTTCACAC CATTTTGTTTAG 384 0 
3841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3900 



MMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIMIIIIIIIIMI 

Db 3841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3900 

Qy 3901 ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3960 

I II iM"ll Mill III MM MM I II Illllllllllllllllllllllll 

Db 3 901 ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3960 

Qy 3 961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 

lllllllllllll IMIIIIIIIIMMI! MIIIIIIMMMI IIMIIMII 

Db 3961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 

Qy 4 021 AGTAACTTTGTAGAAATGAG CCAGAAGCCAAGG C C CTGAGTTGGCAGTGGCCCATAAGTG 4 08 0 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMMIIMMMIIIIIIIIIIIII 

Db 4 021 AGTAACTTTGTAGAAATGAG CCAGAAGC CAAGG CCCTGAGTTGG CAGTGG CCCATAAGTG 4 08 0 

Qy 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 

Illllllllllllllllllllllll 

Db 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 



RESULT 4 
AAQ34583 

ID AAQ34583 standard; DNA; 4105 BP. 
XX 

AC AAQ34583; 
XX 

DT 25-MAR-2003 (updated) 

DT ll-MAY-1993 (first entry) 

XX 

DE ETa receptor gene. 
XX 

KW Human; ETa ; ETb; endothelin; receptor; transmembrane domain; N tail; 

KW extracellular; cytoplasmic; C tail; post translat ional ; bovine; 

KW modification; ET-1 receptor; antagonist; circulatory system; ss. 
XX 

OS Homo sapiens . 



XX 

FH Key Location/Qualifiers 

FT CDS 485.. 1768 

FT /*tag= a 

FT sig_peptide 485.. 544 

FT /*tag= b 

FT mat_peptide 545.. 1765 

FT /*tag= c 

FT polyA__signal 4084.. 4089 

FT /*tag= d 

FT misc_feature 1972.. 1976 

FT /*tag= e 

FT /function= Related with mRNA instability 

FT misc_feature 2059.. 2063 
FT /*tag= f 

FT /function= Related with mRNA instability 

FT misc_feature 2309.. 2313 
FT /*tag= g 

FT /function= Related with mRNA instability 

FT misc_feature 2386.. 2390 
FT /*tag= h 



FT /function= Related with mRNA instability 

FT misc_feature 2680.. 2684 
FT /*tag= i 

FT /function^ Related with mRNA instability 

FT misc_feature 3252.. 3256 
FT /*tag= j 

FT /function= Related with mRNA instability 

FT misc_feature 3944.. 3948 
FT /*tag= k 

FT /function= Related with mRNA instability 

XX 

PN EP522868-A1. 
XX 

PD 13-JAN-1993. 
XX 

PF 10-JUL-1992; 92EP- 03 06347 . 
XX 

PR 12-JUL-1991; 91JP-0172828 . 
XX 

PA (SHIO ) SHIONOGI SEIYAKU KK. 
XX 

PI Imura H, Nakanishi S, Nakao K; 
XX 

DR WPI; 1993-010677/02. 

DR P-PSDB; AAR30885. 
XX 

PT Human ETa and ETb endothelin receptors - for measuring endothelin 

PT and screening for endothelin antagonists 

XX 

PS Claim 6; Fig 1; 3 9pp; English. 
XX 

CC The sequences given in AAQ34 583-84 encode the human ETa and ETb 

CC endothelin receptors respectively. ETa is a 427 amino acid protein 

CC with a molecular weight of 48,726. ETb comprises 442 amino acids and 

CC has a molecular weight of 49,629. ETa has a higher affinity for 

CC endothelin (ET) -1 and ET-2, whereas ETb has no selectivity for ET-1, 

CC ET-2 or ET-3. The receptors each contain seven transmembrane domains 

CC and have an extracellular N tail and a cytoplasmic C tail. There are 

CC several potential sites for post translat ional modification, these 

CC sites are identical to those of bovine ET-1 receptor. ETa cDNA is 

CC 91.2% homologous to bovine ET-1 receptor cDNA and ETb cDNA is 61.1% 

CC homologous to that of bovine ETa -receptor . The receptor proteins are 

CC useful as reagents for measuring the amount of ET or screening for 

CC antagonists of the ET receptor when studying the circulatory system. 

CC (Updated on 25-MAR-2003 to correct PN field.) 
XX 

SQ Sequence 4105 BP; 1138 A; 860 C; 844 G; 1263 T; 0 other; 

Query Match 100.0%; Score 4103.4; DB 14; Length 4105; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4104; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
QY 1 GAATTCGCGG C CG CCT CTTG CGGTCC CAGAGTGGAGTGGAAGGTCTGGAG CTTTGGGAGG 60 

MIIIIIIMMIIIIIIIIIIIIIIIIIIIIMIIIIIIMMIMIIIIIMIIIIM 

Db 1 GAATTCGCGG CCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 60 

Qy 61 AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 



Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 



IIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIMMIIIIIIIIIIIIillllllllll 

AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 120 
CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 18 0 

IIIMIIIIIIIMIIIIIIIIIMIIIIIIII IIIIIIIIIIIIIIMIIIMIIMI 

CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACCATTGTGGAGAGGCGGTGGAGAGGCTT 18 0 
CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

lllllll llllilllllll lllllllillll-MIIIIIIIIII IIIIIMIIIII 

CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 
GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 3 0 0 

IIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIMMIIIIIIIIIIIIIIIMIIMI 

GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 300 
CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 3 60 

iiiiiiiiimiiiiiiiiiiiimiiiiiiiiimiimiiiiiiiiiiiiiiiiiiii 

CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 
TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 420 

I III I II II I ll: II' II i M! II: I II II, I 

TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 
AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 48 0 

II : ll!;l II IIMI Ihlll II I 'II III I lll ! ll::ll 

AAAAAGTGAAGGTGTAAAAGCAGC^CAAGTGCAATAAGAGATATTTCCTC^W^TTTGCCT 4 8 0 
CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

II II. I II llnl. Il.lll II I II II I IM IMII 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 
CAGTGATAATCCTGAGAGATACAG CACAAATCTAAG CAATCATGTGGATGATTTCACCAC 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

CAGTGATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCAC 600 
TTTTGGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 
ACCCAGCAATGGCTCAATGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTT 72 0 

IMIIIIIIIIIIIIIMIMMIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIMI 

ACCCAGCAATGGCTCAATGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTT 72 0 
CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 

Ml II I MiMh.ll |.:ll 11,11 II ||: I IMIIMI 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 
AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

IIIMMIIMIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 
AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

IIMIIMIIMIIIIIIMMIIIIIIIIIIIIIIMMIIIIIIIIIIMIIMIIII 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 
TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIMMMMMIIIIIIIIMIIIIIIIIIM 



Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Ov 


J. O O -L 


Db 


1681 


Qy 


1741 


Db 


1741 



901 TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 
GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

I II 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 Ml I II IMM MM 1 1 1 M 1 1 1 1 1 II II 1 1 1 1! 1 1 1 1 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 
TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 108 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

I I I I ' I I I : I I I I : I I I I II I 1 II 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

E I ! 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 [ 1 1 1 ! 1 1 1 1 E E 1 1 1 !! 1 1 1 1 M 1 1 1 1 II 1 1 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 
TATGCTC^TGCCACATC^AATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 

m i ii: iiiiiiMi i ii i ii ii i ii i ii iiiii 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 
CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTAQ\CCCTCATGAC 132 0 
TTGTGAGATGTTGAACAGAAGGAATGGCAG CTTGAGAATTG CCCT CAGTGAACATCTTAA 138 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 

TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 138 0 
GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

II: Mill I! I II M; II M IIMM IM Ml I MM II II 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 
GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

I Ml III III IIMM MMI II IM I! I Ml I II Mill 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 
CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

I III II IIMMIMI I III Mill M,M IIMM II MMI 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 
CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 162 0 

IM Mill II I II I IM II I! Ill M IM I MM II II 

CATGAATTCATGTATAAACCCCATAG CT CTGTATTTTGTGAG CAAGAAATTTAAAAATTG 162 0 
TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

1 1 1 M 1 1 1 1 1 1 1 1 1 M M 1 1 ! II I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 168 0 
CATGAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACC^ 174 0 

IIMIIIIIIMIIIIIMIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 

CATGAACGGAACAAGCATCCAGTGGAAGAACCACGAT 174 0 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

IIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIII 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 



Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 



AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 18 60 

IMIIIIMIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIII 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 18 60 
CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 1920 

IIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 
CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 198 0 

I Mill II IIIM; II M II .1 Ml M Mill IHMII II 

CTGGTTTATCCyVCCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 1980 
TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 

IIIIIIIIIMIIMIMIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIII 

TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 
AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 

MM II Mill M IM I MUM II I MllIMM IM II I 

AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 
CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 

IMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIMMIIIIIIIIIMII 

CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 
AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 

I IMIIMII IMM III III II M IMMMIIMI IIIMI II I 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCA.GCCAAACACA 222 0 
ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 2280 

IIIIMIIMIIIMIMIIIIMIIIIMIIIIIMIIMMIIIIMIMMIIIIII 

ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 22 80 
TAATAGCCTAAC^TGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGAaWV 234 0 

MMI II II II M I! I MM M II M II III II IM MM 

TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 
AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 

I M III II M I Ml M: IMM II I IM I Ml II II I 

AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 
ACACAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATG 2460 

M M M II M, M M M I MM MMI I IMM III II 

ACACAAATTCTAAAGCTAC^CAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACA 24 60 
TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 252 0 

MM II Mill II Ml I Ml M || || mil II II Mill 

TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 252 0 



M I M II II II I II I MM IMM I IMM Ml 



TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCA.TGTAAATTTTGTATGA 264 0 

IM I Ml I Ml II Ml IMM MM I MM IMIIMI II I 

TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 



Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Dv 
v!Y 


^ ^ i 


Db 


3361 


Qy 


3421 


Db 


3421 



AAAATCAATGTC^GTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 

MIIMIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMI 

AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 

TCAGTG CACTGTATATAGAAGTCTAAAACACAC CTAAGAGAAAAAGAT CGAATTTTTCAG 2760 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 

ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 282 0 

IMIIIIIIIIMMIIIIIIIIIMIIIIIIIIIIMIIIIIIMIIIIIIIMIIMI 

ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 282 0 

CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 2880 
M ,11 II III h:ll' I II I! II: II Mi III II lh II II 

CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 288 0 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 

I III II III MIIM'M II Mi M II II IM:I lllllll I 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 

GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3 000 

lllilllllllMIIIIIIIIIIIIIIIIIMIIIIIII IIIIIIIIIMMIIIIIII 

GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3000 

TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACA.TTGTCACCATTTCAAAG 3060 

I II II Mill II II 1 1 III Mill II llllll III II II Ml MM II I MM II III II 

TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 3 060 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 312 0 

I III II MINI II! I II ,11 :L II II II I III II II 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 312 0 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 3180 

.1 1 : 1 1 1 1 ' 1 1 1 : 1 : II III 1 1 ^ II 1 1 ! I [ 1 1 1 : 1 1 1 II 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 3180 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 

I II h II M Ml I II !i 1:11 I M I II INI 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 

GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 3300 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 

GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 3300 

GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 

I Ml II Jl I II I II II I II' I .11 M II :ll I 

GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 

TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 342 0 

I I I I M I I M II I I M I I I I I II I I I I I II I I I I I I I I I I I I || M I I I I I II II I I I I I 

TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 342 0 

AGAGG CAG CGTGAAAGCAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGT 34 80 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

AGAGG CAGCGTGAAAGCAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGT 3480 



Qy 



34 81 TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 



IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 3481 TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 

Qy 3541 CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

IMIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIMMIIIIIIIIMIIIIIIIIIII 

Db 3541 CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

Qy 3601 TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 
I I I J ' I I : I I I i I I i I ' j II - I II III I II ' I ! 1 1 E 1 

Db 3 6 01 TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 

Qy 3 661 ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 372 0 

IIMMIIMIIIIIIIMIIIIIIIIMIMIMIIIIIIIIIIMIIIIIIIIIIIM 

Db 3661 ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 372 0 

Qy 3721 CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3780 

MMIIIIIIIIII Mlilll IIMIIMIIIIIMIII IMMIIMIIIIIIIIMI 

Db 3721 CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3780 

Qy 3781 CATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIMI 

Db 3781 CATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 

Qy 3841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3 900 

I II II MM I IMM II I II Ml IM II III II II M 

Db 3841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3 900 

Qy 3901 ATT CAGAAAGT CATAGATTTCTGAAGGCGTCAACGTG CATTTTATTTATGGACTGGTAAG 3 960 

I II II ill I II il II I II. ill II II Ml II, II II 

Db 3901 ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTG CATTTTATTTATGGACTGGTAAG 3 960 

Qy 3 961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 

M II M MM I II M II I II Ml I II Ml IM II II 

Db 3961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 

Qy 4 021 AGTAACTTTGTAGAAATGAG CCAGAAGCCAAGGCCCTGAGTTGG CAGTGG CCCATAAGTG 4 080 

I II II MM II II I II I II II M II Ml IM II II 

Db 4 021 AGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTG 4 080 

Qy 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 

IIIIIIIIIIIIIMIIIIIIIIII 

Db 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 
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XX 
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XX 
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XX 
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XX 
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XX 
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XX 

PI Norberg LT, Anders son MK, Lindstrom PHR, Jons son L; 
XX 

DR WPI; 2000-318010/27. 
XX 

PT Assessing cardiovascular status in humans involves comparing test 

PT polymorphic pattern comprising polymorphic positions within genes 

PT encoding specific proteins , with reference polymorphic pattern 
XX 

PS Disclosure; Page 125-126; 126pp ; English. 
XX 

CC The invention relates to a novel method of assessing the cardiovascular 

CC status in an individual and to newly identified polymorphisms in the 

CC genes encoding angiotensin -converting enzyme (ACE) , angiotensin II 

CC receptor type 1 (ATI) and type 2 (AT2) , angiotensinogen (AGT) , renin, 

CC aldosterone synthase, endothelin receptor type A and beta-adrenergic 

CC receptors 1 and 2 . The method comprises determining the sequence at one 

CC or more polymorphic positions within these genes, and comparing the 

CC pattern of polymorphisms from the individual with a reference polymorphic 

CC pattern obtained from a population of individuals exhibiting a 

CC predetermined cardiovascular disease status. The polymorphic markers are 

CC useful for determining the predisposition of an individual to 

CC cardiovascular disorders such as myocardial infarction, unstable angina, 

CC hypertension, atherosclerosis and stroke. They are also useful for 

CC predicting the likely cardiovascular status of a patient given a 

CC treatment regimen comprising administration of cardiovascular drugs 

CC (e.g., ACE inhibitors, beta-adrenergic receptor antagonists (beta- 

CC blockers) or calcium channel blockers) . One or more polymorphic markers 

CC provides a basis for predicting the outcome of a treatment regimen. 

CC Fragments of the genes comprising a polymorphic site may be used as 

CC primers and probes for detecting genetic polymorphisms or in molecular 

CC library arrays for high throughput screening. The genes, and the proteins 

CC they encode are useful in the screening of potential cardiovascular 

CC drugs. Determination of an individual's polymorphic pattern reduces or 

CC eliminates trial and error in selecting a treatment for a particular 

CC individual cardiovascular patient. It also provides the ability to 

CC eliminate patients from clinical trials who are predicted to be 

CC non- responsive, or at a risk for an adverse response, to a particular 

CC treatment regimen. Adverse results in an early trial can be evaluated to 

CC identify polymorphic patterns so that the adverse results can be 

CC correlated with a sub-population of the test population, permitting 

CC exclusion of such sub -populations from the treatment group. Beneficial 

CC drugs can be approved for use in the appropriate population, thereby 



CC decreasing the number of patients required for a clinical trial, which in 

CC turn decreases the duration and cost of such trials. The present 

CC sequence represents the human endothelin receptor type A gene 

CC coding region (GenBank S57498) . The polymorphic sites identified are 

CC 969C/T, 1005A/G, 1146A/G and 2485T/C. 

XX 

SQ Sequence 4105 BP; 1138 A; 859 C; 843 G; 1263 T; 2 other; 

Query Match 100.0%; Score 4103; DB 21; Length 4105; 

Best Local Similarity 100.0%; Pred- No. 0; 

Matches 4103; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 60 

MIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIMIMM 

Db 1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 60 

Qy 61 AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

II 1 1 M i 1 1 1 1 1 , i 1 1 1 1 II 1 1 1 1 ! I M M II M 1 1 M ' 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 IN 

Db 61 AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

Qy 121 CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 18 0 

IIIIMIIIIIMMMIMIIIIMIIIIIIIIIIIIIMIIIMIIIIIIIIIIIMI 

Db 121 CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 18 0 

Qy 181 CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

1 1 M 1 1 1 !l I ! I 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II II 

Db 181 CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

Qy 241 GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 3 00 

IIIIIIIIIIIIIIMIIIIIIIIIIMIIIMIIIIIIIIIIMIIIIIIIIIIIIMI 

Db 241 GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 300 

Qy 3 01 CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

IIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIMIII 

Db 3 01 CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 3 60 

Qy 361 TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIMIIIIIIIIIMIII 
Db 361 TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 420 

Qy 421 AAAAAGTGAAGGTGTAAAAG CAGCACAAGTG CAATAAGAGATATTTCCT CAAATTTG CCT 480 

: . I I : I I I i II: I I I I I I : I I I lllll j I i I I J I 

Db 421 AAAAAGTGAAGGTGTAAAAGC^GCACAAGTGC^TAAGAGATATTTCCTCAAATTTGCCT 480 

Qy 4 81 CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

I I ' II ! ■ I I I I: , I I 

Db 4 81 CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

Qy 541 CAGTGATAATCCTGAGAGATAC^GCACAAATCTAAGCAATC^TGTGGATGATTTC^CCAC 600 

IIIIIIIIIIIIIIMIIMIIIMIIIIIMIIIIIIIIIIIIMIIIIIIIIIMIII 

Db 541 CAGTGATAATCCTGAGAGATACA.GCACAAATCTAAGCAATCATGTGGATGATTTCACCAC 600 

Qy 601 TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCC^CTAATTTGGTCCT 660 

II II .11 ,h II. Ih II II II 'Mi I Mi II 'IM II 

Db 601 TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCC^ 660 



Qy 



661 ACCCAGCAATGGCTCAATGCACAACTATTGCCCAC^ 72 0 



1 



Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 



IMIIMI II IM MMMM III II II III II II M II M'M 

ACCC^GCAATGGCTCAATGCA 72 0 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 

M M II M IM I M MMI ;|, || ||' II II I II M I 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 
AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

IIIIIIIMIIIIIII IIIIIIIIIIIMIIII IIIIIIMIIIIIIIIIIIIIMIMI 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 840 
AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

Mill II MMI I MUM III M: IMMMII IM II M II II 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 90 0 
TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

IIIIIIMIIII INI IIIIIIIIIIIMIIII lllllllllllllllllllllllllll 

TAAGCTGCTGGCTGGG03CTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 
GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 1020 
TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 E 1 1 1 1 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 
TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

III IMIIII IIIIIIIIIIIIIIMM IIIIIIIIIIIIMIIIIIIIIIM II 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 
TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 
TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 

M I II II II 1 1 II I M I II 1 1 1 II I ! I II 1 1 1 II M II I II I M I II II II II 1 1 II 1 1 1 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 
CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

ii 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 e 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 
TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 138 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j f 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 

TTGTGAGATGTTGAACAGAANNAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 1380 
GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

1 1 1 1 1 1 1 ! 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 
GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 15 00 

M Mill II II M II I III I Ml M IM Mill IM I! II 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 
CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

M Ml I MIMM I, II M IM M IIMIMIM MMMM MIMI 
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1501 CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 
CATGAATTCATGTATAAACCC CATAG CTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 162 0 

IIIMIIMMIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIMIMIMIIIIII 

CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 162 0 
TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII! 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 
CATGAACGGAACAAGCATCCAGTGGAAGAACCACG 174 0 

i i ' 1 i ! ; . i I ! ■ i ! I ; 1 1 ■ 

CATGAACGGAACAAGCATCC^GTGGAAGAACCACGATCAAAACAACCACAACACAGACCG 174 0 
GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 18 00 

1 1 li 1 1 1 1 1 1 1 M 1 1 1 i Ml I II 1 1 III II II I III 1 1 M I! 1 1 1 M 1 1 1 Ml 1 1 1 1 1 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 
AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

1 1 1 1 II 1 1 II I M 1 1 1 1 II II III IIIMI II I III 1 1 1 1 1 1 II I II I II III 1 1 II I 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 
CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 

l> MM l 1 1 1 ' I [ I 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 
CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 198 0 

MM MMMMMIMM Illlllll Illlllll MMMMMMIIMM 

CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 198 0 
TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 

IMMIIMIIIIIIIIIIIIIIIMIIMIIMIIIIIIIIIIIIIIMIIIMIIIII 

TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 
AAATGAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTT CAAGTACATGG 2100 

IIIIMIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIII 

AAATGAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATGG 2100 
CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCAGCATGAAACTTTAG 2160 

1 1 1 1 1 1 1 ! I II ■ I i 1 1 M 1 1 M M ! I M 1 1 1 1 1 1 1 1 1 IIIIM 1 1 1 M II 1 1 1 1 1 1 

CTAG CTTTTATGG CAGTTCTGGTGAATGTTCAATGGGAACTGGTCAC CATGAAACTTTAG 2160 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 

I ' ! Ml I: ' : 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 

ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 228 0 

■ 1 1 MMM M' M ; 1 MM : 

ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 228 0 
TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 

M I M 1 1 1 1 1 1 M I M 1 1 1 1 1 1 1 1 1 1 M ' 1 1 [ I ! ! 1 1 MM 1 1 1 M I ! 1 1 1 1 1 1 1 1 

TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 
AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 

IIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMI 

AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 
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AC^CA^TTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACA 2460 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

ACACAAATTCTAAAGCTACAACA^ 2460 
TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 2520 

I! II M III IIMMMI II MMI Ml Ml M IM IM I 

TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 2520 
ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2580 

M II MMM IMMi M II Mill Ml Ml ll II Ml I 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2580 
TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 

TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 
AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 r 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 27 00 
TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 

1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 

TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 
ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 2 820 

1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 j 1 1 1 i 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 

ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 2 820 
CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 2880 

IMMM I MM M M IMIIM II IM II III M. II Ml 

CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 2 880 
TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 

IIIIIIMIIIIIIIIMIIIMIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 
GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3 000 

II II I II II II Ml II MMI III III M Ml Ml I 

GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3000 
TC^GTATGAACCTAACTCCCCACCCC^ACATCTCCCTCCCACATTGTCACCATTTCAAAG 3060 

IMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIII 

TC^GTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 3 060 
GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 3120 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIMIIIIIIIIIII 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 3120 
AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 318 0 

IIIIIIIIIIIIIIIIIIIMIMIIIMIIIIIMIIIIIIIIIMMIIIIIIIIMI 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 3180 
ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIMIIIMIII 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 
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GGTATGTATGGATTTAATCTAAT CTAATAATTGTG CCCCG CAGTTGTGCCAAAGTGCATA 33 00 

IIIIIIIIIIIIIMIIIIIMI IMIMII IIIMIIIIIMMMIIIIIIM II 

GGTATGTATGGATTTAATCTAAT CTAATAATTGTG CCCCG CAGTTGTG CCAAAGTG CATA 33 00 
GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 

IIIIIIIIMIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIMIMIIII 

GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 
TGTAGCAACCTTCTGCATTCATAAATCTTGTAATC^^ 342 0 

MM IMIMII MIMMMIMMIM MMMMMIMM IMMIIIM 

TGTAGC^CCTTCTGCATTCATAAAT.CTTGTAATCATGTTACCATTACAAATGGGATATA 342 0 
AGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 34 80 

I ! ' I I' i M I 

AGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 34 80 
TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 

IIIIIIIIIMIMIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIII 

TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 
CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

1 1 1 1 1 1 1 1 1 1 1 1 1 i I ] 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 
TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 

MIIIMIIIIIMMIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIM 

TGTCTGATATTTCTTTCAGACTTCG C CAGACAGATTG CTGATAATAAATTAGGTAAGATA 3660 
ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 3 72 0 

Mill IIMMIIMIIIMIIIIMIIMMIMIII MMMMMMIMM II 

ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 3 72 0 
CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3 78 0 

I II MMMMM II MMI MIMI! II I Ml M II II! I 

CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3 780 
C^TCATATGCAGTGATATATGCCTATAATATAAGCC^TAGGTTCACACCATTTTGTTTAG 384 0 

IIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIII 

CATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 
ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3 900 

III I II MM III I Mllllllll Mil lllllllllllllll Mllllllll MM II! 

ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3 900 
ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3 960 

II I II IM I II I III IM I lllllli Ml Mill Ml I 

ATTC^GAAAGTO^TAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3 960 
TAACTGTGGTTTACTAGC^GGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 

1 1 II 1 1 II 1 1 II I II 1 1 M II 1 1 1 II II II I II 1 1 II I II I II 1 1 II I II I II II II 1 1 1 

TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 
AGTAACTTTGTAGAAATGAG CCAGAAG CCAAGG CC CTGAGTTGG CAGTGGCCCATAAGTG 4 080 

MMMMMMMMMMMMMMMMMMMMIIMIMMIMIMIMM 

AGTAACTTTGTAGAAATGAG CCAGAAGC CAAGG CCCTGAGTTGG CAGTGG CCCATAAGTG 4 080 



Qy 



4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 



IIIMIIIMIIIIIIIIIIIIIII 

Db 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 



RESULT 6 
ACA56659 

ID ACA56659 standard; cDNA; 4079 BP. 
XX 

AC ACA56659; 
XX 

DT 06-JUN-2003 (first entry) 
XX 

DE Human signalling pathway polynucleotide probe SEQ ID NO 1257. 
XX 

KW Human; probe; ss; array element; Parkinson's disease; 

KW signalling pathway population; cancer; adenocarcinoma; leukaemia; 

KW immunopathy; AIDS; asthma; neuropathy; Alzheimer's disease; microarray. 

XX 

OS Homo sapiens. 
XX 

PN US6500938-B1. 
XX 

PD 31-DEC-2002. 
XX 

PF 30-JAN-1998; 98US- 0016434 . 
XX 

PR 30-JAN-1998; 98US- 0016434 . 
XX 

PA (INCY-) INCYTE GENOMICS INC. 
XX 

PI Au-Young J, Seilhamer JJ; 
XX 

DR WPI; 2003-352189/33. 
XX 

PT Combination of polynucleotide probes, useful as array elements in a 

PT microarray for monitoring the expression of a number of target 

PT polynucleotides 
XX 

PS Claim 1; SEQ ID NO 1257; 65pp; English. 
XX 

CC The invention relates to a combination which, comprises a number of 

CC polynucleotide probes comprising a sequence selected from one of the 1490 

CC sequences mentioned in the specification. The combination is useful as an 

CC array element in a microarray for monitoring the expression of a number 

CC of target polynucleotides. The microarray is particularly useful in the 

CC diagnosis and treatment of cancer and immunopathology and neuropathology. 

CC The microarray is useful in diagnostics and treatment regimens, drug 

CC discovery and development, toxicological and carcinogenicity studies, 

CC forensics and pharmacogenomics . The microarray is also useful for 

CC monitoring progression of diseases and for developing sophisticated 

CC profiles for the effects of currently available therapeutic drugs. The 

CC combination is also useful for purifying a subpopulation of mRNAs, cDNAs 

CC and genomic fragments and in research and diagnostic applications. The 

CC array can detect changes in expression in a large number of genes coding 

CC for different signaling pathway populations which can be used to diagnose 

CC various diseases including cancer e.g. adenocarcinoma and leukaemia, 

CC immunopathies e.g. AIDS and asthma, neuropathies e.g. Alzheimer's disease 



CC and Parkinson's disease. The present sequence represents a polynucleotide 

CC probe of the invention. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification but was obtained in electronic format directly from USPTO 

CC at seqdata .uspto .gov/sequence .html ?DocID=0 65 0093 8B1 . 
XX 

SQ Sequence 4079 BP; 1125 A; 856 C; 842 G; 1256 T; 0 other; 

Query Match 99.4%; Score 4079; DB 25; Length 4079; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4079; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

T CG CGG CCG C CT CTTG CGGT CCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGGAGAC 6 4 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIMI 

TCG CGG CCG C CTCTTG CGGTC CCAGAGTGGAGTGGAAGGTCTGGAG CTTTGGGAGGAGAC 6 0 
GGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCTCGCG 124 

i 1 1 1 1 1 1 : i M II I 1 1 1 III I II I! 1 1 1 1 1 II 1 1 1 1 li 1 1 1 Mill 1 1 1 III 1 1 1 1 1 I i 

GGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCTCGCG 12 0 
CGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTTCATC 184 

IIIIMIIIIIIIIIIIMMIIIIIIIIIIMIMIIIIMIMIIIIIMIIIIIIII 

CGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTTCATC 180 

CATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAAGCAG 244 

I I M I I I I I I I I I I I I I I I I I I I I I I M I i I I I I I I I I I I I i I I I I I I I I I I I I I I I I 

CATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAAGCAG 24 0 

TGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGCCGGA 304 

IIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIII 

TGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGCCGGA 300 
GCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTCTGGC 364 

IIIIMIIIIIIIIIIIIIIIIMIIIIIIMIIIIMIIIIIIIMIIIIIIIIMIII 

GCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTCTGGC 360 
CCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAAAAAA 424 

IMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIII 

CCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAAAAAA 42 0 
AGTGAAGGTGTAAAAGCAG CACAAGTG CAATAAGAGATATTTCCTCAAATTTG C CTCAAG 4 84 

IMIMIIMIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIII 

AGTGAAGGTGTAAAAGCAG CACAAGTG CAATAAGAGATATTTCCTCAAATTTG C CTCAAG 480 
ATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGT 544 

II MMI II Mill II I Mill M: IMII I Ml II I 

ATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGT 54 0 
GATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCACTTTT 604 

Mllllllll IIMIMIIIIIIIIIIIIIIMIIIMII IIIIIIMIIIIIIIIMI 

GATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCACTTTT 600 
CGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCTACCC 664 

I II 'Mill II li;ll II IIIMIIIII Mill M III lll'll I 

CGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCTACCC 660 
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665 AGCAATGG CTCAATG CACAACTATTGCC CACAGCAGACTAAAATTACTTCAG CTTTCAAA 724 
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MIIIMIIIMMIIIMMIIIIIIIMIIIIIIIIIIIMIIIMIMIIIIIIIM 

AGCAATGGCTCAATGC^CAACTATTGCCt^CAGCAGACTAAAATTACTTCAGCTTTCA^ 72 0 

TACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACT 784 

I I I I I I I I I I I I I I I I I I I 111 I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACT 780 

CTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGATAGCC 844 

I I I I I I I I I I I I I I I I I I I II I I I I I II II I I I I! I I I I II I I II I I I I II I I I I I I I I I 
CTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGATAGCC 84 0 

AGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAG 904 

IIIIIMIIIIIIIIIIIIIIIIIMIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIII 

AGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAG 900 
CTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTG 964 

II II II' M II !! II MM IMII I II II .1 IMM 

CTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTG 960 
TTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGT 1024 

IIMMIMIIMIIIIMIIIIMIMIMIIMIIMIIIMIIIIIIIIIIIIIIII 

TTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGT 1020 
GTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTG 1084 

IIIMIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIMIIIIIIIIIIMIIII 

GTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTG 1080 
GTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAA 1144 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 

GTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAA 114 0 
GCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATG 12 04 

IIIMIIIIIIIIIIIIMIIIIIMIMMIIIMIMIMIIIIIIIIIIIIIIIMI 

GCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATG 12 00 



MM I MIMI MMll II I II M I Ml III M II II M 



GGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGT 1324 

Ml II I! II II I MMI M II MM Mill M MIMI 

GGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGT 1320 
GAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAAGCAG 1384 

IMM M IIMM I MMI I'M! M M II M I MMI 

GAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAAGCAG 1380 



I MM III Ml IIIIIM IIIIIIIIIMII lllllll II I INI Ml Ml MM Mill 



CCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGA 1504 

IIIMIIIIIIMIIIIMIMIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIMIII 

CCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGA 1500 
TGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATG 1564 

MMI Ml II IMM, MMI I II I. M III M MMM MM 
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1501 TGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATG 1560 
AATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTC 1624 

Nil MMII M III I IMM II MMM II I MM MM 

AATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTC 162 0 
CAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCATG 1684 

I I MMM MM MM I Ml MM MM II I! IN 

CAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCATG 1680 
AACGGAACAAGCA.TCCAGTGGAAGAACCACGATCAAAACAACCACAACACAGACCGGAGC 1744 

I I! II II I II N IN INN Ml I II M MIMIMI 

AACGGAACAAGCATCCAGTGGAAGAACCACGATC^WVACAACCACAACACAGACCGGAGC 174 0 
AGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAATC 18 04 

I ill MIMM M IM M II Ml IM II MMII II II Ml M 

AGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAATC 18 00 
CTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTTC 1864 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIMIIIIII 

CTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTTC 1860 
TTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGACTGG 1924 

I II II IM I IM M II III III II M IN MMM III 

TTCCTTAATTC^CTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGACTGG 192 0 
TTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTCT 1984 

M IMM M I II I MM MMI MM I II MIMM II I 

TTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTCT 1980 
GCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAAT 2 044 

IMM MMI MIMM I II II I: II MMII II II Ml M 

G CGTGTTGTATTCAG CACTAAAAAATGGTGGGAG CTGGGGGAGAATGAAGACTGTTAAAT 2 04 0 
GAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGGCTAG 2104 

II II I II I II I II' .11 

GAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGGCTAG 2100 
CTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGAT 2164 

! ! : : ! 1 ^ . ■ ! 1 I i I ' 1 i III. I i , 

CTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGAT 2160 
TAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATAT 2224 

MM N II MM! I N II N II I II MIMM 

TAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATAT 222 0 
GGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAAT 2284 

I Mi I MMM MMIMIMill M II M II IM M IM I 

GGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAAT 228 0 
AGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGAC^AAAATA 2344 

MIIIIIIIMIMIIIIIIMIMIIMMIIIIIIIIIIIMIIIIIMIIIMIMI 

AGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAATA 234 0 
GTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACAC 24 04 

IIIIIIIIIIMIIIIIIIIIIIIIIIMMIIIIIIIIMIMIIIIIMIIIMIIII 

GTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACAC 24 00 
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Qy 
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Qy 
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AAATTCTAAAGCTAC^C^^TACTACAGGCCCTTAAAGCACAGTCTGATGACACATTTG 24 64 

MIIIIIIIIMMMIIIIIMMMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

AAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACATTTG 24 60 
GCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATGG 2 524 

I M II r II I II; J II <i mi ii: I ii h mum I 

GCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATGG 252 0 
TGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTCA 2584 

illMIIIIIIIIII llilll MINI lllllllllil IIIIIMMMIIII 

TGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTCA 2 58 0 
ATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAAAA 2 644 

M, II i II I MM ll II M II III I II M II Ml I 

ATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAAAA 264 0 



M I IM M II I II I MMI II IM I IM M II MMI 



TGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATGA 2764 

I Ml I Ml M II M II M II II I MM I Ml I I 

TGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATGA 2760 



! M M IM Mill M II M II II I ill 1 I Ml I I 



TCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTCC 28 84 

I M M Ml M III M II IMIIMI III II II MM I 

TCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTCC 288 0 
CTTTTC CATATAGGAAACATAATTTTGAAGTGG CCAGATGAGTTTATCATGT CAGTGAAA 2 944 

IIIMIMIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIII 

CTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAAA 294 0 
AATAATTACCCAOWVTGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCAG 3004 

MMM I Ml II II I Ml M II IM II M MMI M ill 

AATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCAG 3 000 
TATGAACCTAACTCCCCACCCC^CATCTCCCTCCC^CATTGTCACCATTTCAAAGGGCC 3 064 

1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ri 1 1 1 1 1 1 ii 

TATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAGGGCC 3060 
CACAGTGACTTTTGCTGGGC^TTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAAA 3124 

Ml IM I Ml I II I II I II Ml II II I IMIMM 

CACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAAA 312 0 
TCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATTT 3184 

I Mill M II M IMM II MMM I IMMIIMI II I I 

TCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATTT 318 0 



M II II II IMMMIM I MMM III I M II Mill MMI 
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TGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTCT 3304 

M li II Ml M Ml I IIMMMIM IM I IM I II Mill 

TGTATGGATTTAA.TCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTCT 33 00 
GAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGTA 3364 

IIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIMIIIIIMMIIIIIIIMII 

GAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGTA 3360 
GCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGAG 3424 

I MMM IM I M M II II II Ml M Ml M. II Ml I 

GCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGAG 342 0 
G CAG CGTGAAAG CAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGTTGGT 34 84 

M II MMM M II I Ml Mill M MMM I MM IMM 

GCAGCGTGAAAG CAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGTTGGT 34 8 0 
TTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACACT 3544 

M MMI IMM MMM Mini IM IMM II I MMM I 

TTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACACT 354 0 
TTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGTC 3 604 

II II II II M II I Ml II MMM I MMMI II II M 

TTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGTC 3600 
TGATATTTCTTTCAGACTT CG CCAGACAGATTG CTGATAATAAATTAGGTAAGATAATTT 3664 

I II! ! IM I MMM IMMMIMMMM Ml Ml MMM M 

TGATATTTCTTTCAGACTTCG C CAGACAGATTG CTGATAATAAATTAGGTAAGATAATTT 3660 
GTTGGG C CATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTG CTTGAATTGCAAG 3724 

IIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIMII 

GTTGGGCC^TATTTTAGGACAGGTAAAATAACATCAGGTTCC^GTTGCTTGAATTGCAAG 372 0 
GCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCATC 3784 

M II I MM I III I Ml II II II M II Mill MMM 

GCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCATC 3780 
ATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTC^VCACCATTTTGTTTAGACAA 3844 

I MMM' II M II M IMMM m MMM MMMI IM II I 

ATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAGACAA 384 0 
TTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATTC 3904 

M II I Ml I MM I Ml II I II i,MI MMM IMM 

TTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATTC 3900 



M II M II M M M : M:MiMIMMMI III II I Ml I 



TGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGTA 4 024 

I II Ml II I II M IMMI'M 1 IM M IM II M Ml I 

TGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGTA 4020 



I II MMM M M !l IMMM'M MMMI IM IM I Ml M 



RESULT 7 
AAL39858 

ID AAL39858 standard; DNA; 2595 BP. 
XX 

AC AAL3 9858; 
XX 

DT 05-SEP-2002 {first entry) 
XX 

DE Human allergy-associated gene SEQ ID No 30. 
XX 

KW Antiallergic; allergic disease; carboxypeptidase M; cathepsin C; CYP1B1; 

KW endoserine A receptor; osteoblast-specif ic factor 2; DD96; gene therapy; 

KW bronchial asthma; human; ds . 
XX 

OS Homo sapiens . 
XX 

PN WO200252006-A1. 
XX 

PD 04-JUL-2002. 
XX 

PF 21-DEC-2001; 2 001WO- JP1 1287 . 
XX 

PR 26-DEC-2000; 2 000 JP- 03 96166 . 
XX 

PA (GENO-) GENOX RES INC. 
XX 

PI Ohtani N, Matsui K, Yoshida N, Sugita Y, Izuhara K; 
XX 

DR WPI; 2002-500763/53. 
XX 

PT Examining allergic diseases by changes in expression levels of six 

PT allergy-associated genes inducible by stimulation e.g. of airway 

PT epithelia cells with interleukin-4 or 13, also applicable in screening 

PT compounds 

XX 

PS Disclosure; Page 79-80; 106pp; Japanese. 
XX 

CC The invention relates to a novel method for examining allergic diseases, 

CC comprising determining the expression level of a gene selected from 

CC carboxypeptidase M, cathepsin C, endoserine A receptor, osteoblast- 

CC specific factor 2, DD96 and CYP1B1 in the biological sample from a 

CC patient, and comparing the expression level with that in the sample of a 

CC healthy individual. The polynucleotides of the invention can be used to 

CC treat disorders by gene therapy. The method is useful for examining 

CC allergic diseases particularly bronchial asthma and its diagnosis, which 

CC is also applicable in screening candidate compounds for remedies. This 

CC polynucleotide sequence represents a human allergy-associated gene of the 

CC invention. 

XX 

SQ Sequence 2595 BP; 788 A; 478 C; 459 G; 870 T; 0 other; 

Query Match 60.2%; Score 2470; DB 24; Length 2595; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2481; Conservative 0; Mismatches 0; Indels 1; Gaps 1; 
Qy 1624 CCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCAT 1683 

IMlilllMlllMMllillllllllllllllliiiiiiiiiiMlllllMIIIIII 
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Qy 
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15 CC^GTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCAT 74 

GAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACy^ACCACAACACAGACCGGAG 1743 

I I I I I I M I I I I I M I II I I I I I III III I I I I I I II I I I I I I II I I I I I I I I I I I 

GAACGGAAC^GCATCCAGTGGAAGAACCACGATCAAAAC^CCACAACACAGACCGGAG 134 

CAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAAT 1803 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIMIMIMMIMIIMIIIII 

CAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAAT 194 
CCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTT 1863 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

CCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTT 254 
CTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGACTG 1923 

IIIIIIIIIMMIMMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMMI 

CTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGACTG 314 



I M, II I II MMM I IM I III I II !M M III M II 



TGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAA 2043 

I Mi II I M MMM MMM I Ml M MIIIMM IMM II 

TGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAA 434 
TGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGGCTA 2103 

I II I II I ll MMI I II M, II II I Ml M I 

TGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAG CTTTCAAGTACATGG CTA 4 94 
GCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGA 2163 

IIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIII 

GCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGA 554 



IMIIIMI IIIIMIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIMIIIMII 



TGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAA 2283 

MMM ! II II IM I III I MM I II III M MMM II 

TGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAA 674 
TAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAAT 2343 

II II III MM I II II III III Ml III II III MM I II II III II II Ml MM I III 

TAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAAT 734 
AGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAAG\ 24 03 

I Ml M M II II II I II Ml II Mill II II MM MM 

AGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACA 794 

CAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACATTT 24 63 

II II I I I I I II I I I II I I I II M I II II II II II I I I II I I I II I I I II II I I I M I II I 
CAAATTCTAAAGCTACAAO^TACTACAGGCCCTTAAAGCAC^GTCTGATGACACATTT 854 

GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 2523 

II MMM II M Mh,MMl I IMM II. M M M Mill 

GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 914 
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Qy 
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GTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTC 2583 

I . : 1 ill 1 ' 1 1 1 1 : 1 1 ' ! M 1 1 ! 1 1 

GTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTC 974 
AATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAAA 2643 

Mill MINI I II II I II lllllll III IIIIIIIIIMEIIIMIIIIIIIII III 

AATCAGATAGTTCTTTTTCACAAGTTCAAT - CTGTTTTTCATGTAAATTTTGTATGAAAA 1033 
ATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCA 2703 

IIMIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIII 

ATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCA 1093 

GTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATG 2763 

I I I I M I I I I I I I I I . II I I I I I I II I II I I li II I I I M I I II I I M I I I II I I I 
GTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATG 1153 

ATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCAC 2823 

I II IM I MUM MIMI MMII IMM I II II IM I! I 

ATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCAC 1213 
CTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTC 2883 

I I Ml MMI! 1 II M II II II III II III MMIMI Ml 

CTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTC 1273 
CCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAA 2 943 

M I Ml <M II I II I II I! Ml ill II III Mill II II 

CCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAA 1333 
AAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCA 3003 

I M M I II M IM Mill II Ml II Ml I! I II I M 

AAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCA 1393 
GTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAGGGC 3 063 

MIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

GTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAGGGC 1453 
CCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAA 3123 

IIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIM 

CCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAA 1513 
ATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATT 3183 

I Ml II I IMM III II II I II II Ml II I II I I 

ATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATT 1573 
TTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGT 3243 

MMI MMI I I! M IMM IMM' II IMMI MMIMI 

TTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGT 1633 



M II M, I II I II I II I M M Ml IM II IM I II 



TGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGT 3363 

MMI IM IMM I II I II M II M III Ml II IM I II 

TGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGT 1753 



Qy 


3364 


Db 


1754 


Qy 


3424 


Db 


1814 


Qy 


3484 


Db 


1874 


Qy 


3544 


Db 


1934 


Qy 


3604 


Db 


1994 


Qy 


3664 


Db 


2054 


Qy 


3724 


Db 


2114 


Qy 


3784 


Db 


2174 


Qy 


3844 


Db 


2234 


Qy 


3904 


Db 


2294 


Qy 


3964 


Db 


2354 




*± \J Zi *± 


Db 


2414 


Qy 


4084 


Db 


2474 



il ll :L i: I I M Mill J I I il I II II M Ml 



GGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGG 3483 

I M II II I I II Mill II I I II I || II II I'll 
GGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGG 1873 



IMM II M I I II Mill M I I M I II II M I M 

TTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACAC 
TTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGT 

IIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIMIIIIIIIIIIIIIIMIIMIIII 

TTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGT 
CTGATATTTCTTTCAGACTTCG CCAGACAGATTG CTGATAATAAATTAGGTAAGATAATT 

IIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIMIIIIIIII 

CTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATT 
TGTTGGGCCATATTTTAGGACAGGTAAAATAAC^TCAGGTTCCAGTTGCTTGAATTGC^A 

I II IMM MM II II II M I I M II II II I IM I 



GGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCAT 3783 

I II II M! I II II II II M I I Ml I III II MMI I 

GGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCAT 2173 
CATATG CAGTGATATATG CCTATAATATAAG CCATAGGTTCACACCATTTTGTTTAGACA 3 843 

I II II Ml I M II II II I I M IM I III II MM '! 

CATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAGACA 2233 

TTTTTmTTTTTTTT^ 3903 

ATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATT 2293 



M II II I I I, II M II I Mi I I Ml II II M I 



CTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGT 4 023 

I M II I I IMMI IMIMM IM I I Ml II II I I 

CTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGT 2413 
AACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAA 4 083 

Mill I M Ml I IMM II I I MMM II IMIMM IMM 

AACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAA 24 73 



MIIIIIIIIIIIIIIMIM 



RESULT 8 
ABK94408 

ID ABK94408 standard; DNA; 2595 BP. 
XX 



AC ABK944 08; 
XX 

DT 27-AUG-2002 (first entry) 
XX 

DE DNA encoding endothelin receptor A (EDNRA) , exon 8. 
XX 

KW Endothelin; EDN; endothelin converting enzyme; ECE; EDNRA; 

KW endothelin receptor A; signaling system; cardiovascular disease; 

KW coronary heart disease; hypertension; atherosclerosis; angiogenesis ; 

KW fatty acid metabolism; diabetes; familial hypercholesterolemia; 

KW forensic marker; transgenic animal; solid support; SNP; 

KW cardiovascular regulator; gene; ds; single nucleotide polymorphism. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT variation replace { 1830 , G) 

FT /*tag= a 

FT /standard_name= "Single nucleotide polymorphism" 
XX 

PN WO200224747-A2 . 
XX 

PD 28-MAR-2002. 
XX 

PF 31-AUG-2001; 2 001WO-EP10087 . 
XX 

PR 19-SEP-2000; 2000EP-0120123 . 
XX 

PA (EPID-) EPIDAUROS B I OTECHNOLOG I E AG. 
XX 

PI Brinkmann U, Hoffmeyer S; 
XX 

DR WPI; 2002-435060/46. 
XX 

PT Novel polynucleotide of the endothelin/endothelin converting 

PT enzyme/receptors of endothelin and endothelin converting enzyme 

PT signaling system associated with cardiovascular disease, useful for 

PT treating the disease 

XX 

PS Claim 1; Page -; 190pp; English. 
XX 

CC The invention describes a polynucleotide (I) of the endothelin 

CC (EDN) /endothelin converting enzyme (ECE) /receptors of EDN and ECE (EDNR) 

CC signaling system which is associated with a cardiovascular disease. (I) , 

CC the gene encoding EDN, ECE or EDNR (II) or a vector (III) expressing (I) 

CC or (II) is useful for producing cells capable of expressing a molecular 

CC variant polypeptide which is associated with a cardiovascular disease. 

CC (II), (III), the EDN, ECE or EDNR polypeptide, or a cell expressing 

CC a molecular variant gene comprising (I) is useful for identifying and 

CC obtaining a pro-drug or drug capable of modulating the activity of a 

CC molecular variant of a polypeptide of the EDN/EDNR/ECE signaling system 

CC or its gene product, or for identifying and obtaining an inhibitor of 

CC the activity of a molecular variant of a polypeptide of the EDN/EDNR/ECE 

CC signaling system or its gene product. The isolated proteins and 

CC polynucleotides encoding them are useful for preparation of a 

CC pharmaceutical composition for treating a cardiovascular disease such as 

CC coronary heart disease, hypertension, atherosclerosis, or related to 



CC abnormal angiogenesis or fatty acid metabolism e.g. diabetes and familial 

CC hypercholesterolaemia . The gene or a polynucleotide fragment of the 

CC EDN/ECE/EDNR signaling system are useful as forensic markers, for 

CC creating a transgenic animal and in creation of a solid support 

CC comprising polynucleotides, genes, vectors, polypeptides, antibodies or 

CC host cells of the invention. This sequence encodes the cardioavscular 

CC regulator Endothelin receptor A (EDNRA) . 

CC Note: This sequence does not appear in the specification but has been 

CC obtained from GenBank using information given in the invention. 

XX 

SQ Sequence 2595 BP; 788 A; 478 C; 459 G; 870 T; 0 other; 

Query Match 60.2%; Score 2470; DB 24; Length 2595; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2481; Conservative 0; Mismatches 0; Indels 1; Gaps 1; 
Qy 1624 CCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCAT 1683 

i ■ ,i !i i i , I, , in I I 

Db 15 CCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCAT 74 

Qy 1684 GAACGGAACAkGC^TCCAGTGGAAGAACCA 1743 

IIIIIIIIIIIIIIMIIIIIMIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIMIII 

Db 75 GAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAA 134 

Qy 1744 C^GCCA^TAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAAT 18 03 

Ml ii IMIi M Mi I lh I ill IMII II I Mi h I 

Db 135 CAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAAT 194 

Qy 18 04 CCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTT 1863 

: 1 1 II I II I ' ! I ' I , , 1 1 I II II ' 1 1 : 1 1 ; I : II l.ll 

Db 195 CCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTT 254 

Qy 1864 CTTCCTTAATTCACTCCCACACCCAAGAAGAAATGC 1923 

IMIIIIIIMIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMI 

Db 2 55 CTTCCTTAATTC^CTCCC^CACCCAAGAAGAAATGCTTTCOVAAACCGCAAGGTAGACTG 314 

Qy 1924 GTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTC 1983 

IMIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIII 

Db 315 GTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTC 374 

Qy 1984 TGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAA 2 043 

'I II I MM IMII I II' 'I IIIMII II II II IM I I 

Db 375 TGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAA 434 

Qy 2044 TGAAACC^GAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTC^AGTACATGGCTA 2103 

IIIIIIIIIIIIMIIIIMIIIIIIIMIIMIIIIIIIIIMIIIMIIIIIIIMII 

Db 435 TGAAAC CAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATGGCTA 4 94 

Qy 2104 GCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGA 2163 

I II II 'I Ml 1 I .11 L II II MM II 11:11 II !M I I 

Db 4 95 GCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGA 554 

Qy 2164 TTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAAC^CAATA 2223 

I M I J : M II I , 111:1 ,11 , 1 1 I - 1 1 I 

Db 555 TTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATA 614 



Qy 



2224 TGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAA 2283 



I M II I Ml M II I II M II Ml II II M II M I 



Db 


615 


Qy 


2284 


Db 


675 


Qy 


2344 


Db 


735 


Qy 


2404 


Db 


795 


Qy 


2464 


Db 


855 


Qy 


2524 


Db 


915 


Qy 


2584 


Db 


975 


Qy 


2644 


Db 


1034 


Qy 


2704 


Db 


1094 


Qy 


2764 


Db 


1154 


Qy 


2824 


Db 


1214 


Qy 


2884 


Db 


1274 


Qy 


2944 


Db 


1334 


Qy 


3004 


Db 


1394 


Qy 


3064 



674 



TAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAAT 2343 

IIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIII 

TAGCCTAA(^TGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAAT 734 
AGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACA 24 03 

I I i I : I III I I J : I ' 1 1 1 I 1 1 : 1 J ! I l ! : I I : 

AGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACA 7 94 
CAAATTCTAAAGCTACAACAAATACTAC^G 2463 

M MMI I! II M II I Ml M II I Ml IM I MM MM 

CAAATTCTAAAG CTACAACAAATACTACAGG CCCTTAAAG CACAGTCTGATGACACATTT 854 " 
GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 2523 

I II II I II! I II Ml II MMI Ml II IMMI II M II 

GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 914 



I IMM li I II I II MMI M II II M; II I li 



AATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAAA 2643 

M I MMI III II Ml I II I MM I Mill II II II II 

AATCAGATAGTTCTTTTTCACAAGTTCAAT - CTGTTTTTCATGTAAATTTTGTATGAAAA 1033 
ATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCA 2703 

M M I! I II I II I M I II I Ml II I II I I 

ATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCA 1093 
GTG CACTGTATATAGAAGT CTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATG 2763 

IMMI II III II MMI III II II I MIMM I MM MM 

GTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATG 1153 
ATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCAC 2 823 

II II I M I M I IM | MM I II II II II M I 

ATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCAC 1213 
CTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTC 2883 

MIMI M III II IM I III M II M MI II I MM M l 

CTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTC 1273 
C CTTTTCCATATAGGAAACATAATTTTGAAGTGG C CAGATGAGTTTATCATGTCAGTGAA 2 943 

II M M Ml I M I IM M II M II MMI! II MM! 

CCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAA 1333 
AAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCA 3003 

MIMM I II' M II. I Ml M II I MIMI MMI! I I 

AAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCA 1393 
GTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAGGGC 3 063 

IMIIIIMIMIIIIIIIMIIIIIIIIMIIMIIIIIIIIIIIIIIMIIIIIIMI 

GTATGAACCTAACTCCCC^CCCCAAC^TCTCCCTCCCACATTGTCACCATTTCAAAGGGC 14 53 



MM M MMI M II I II I Mil M IM I MIMMM'M 



Db 



14 54 CCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAA 1513 



Qy 


3124 


Db 


1514 


Qy 


3184 


Db 


1574 


Qy 


3244 


Db 


1634 


Qy 


3304 


Db 


1694 


Qy 


3364 


Db 


1754 


Qy 


3424 


Db 


1814 


Qy 


3484 


Db 


1874 


Qy 


3544 


Db 


1934 


Qy 


3604 


Db 


1994 


Qy 


3664 


Db 


2054 


Qy 


3724 


Db 


2114 


Qy 


3784 


Db 




Qy 


3844 


Db 


2234 


Qy 


3904 


Db 


2294 



ATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATT 3183 

I I I I I I I I I Ml I I I I I Ml I I I I I I I I I I II I I I I I I I I I I I I : I II I I I I I I I 
ATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATT 1573 

TTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGT 3243 

imiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

TTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGT 1633 
ATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTC 3303 

I II IMP II. I II II Mi I lli'll I Ml ll'll 

ATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTC 1693 
TGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGT 3363 

IIMIMMIIMIIMIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII! 

TGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGT 1753 
AGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGA 3423 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

AGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGA 1813 

GG CAGCGTGAAAG CAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGTTGG 34 83 

I I I I M M' M il 

GG CAG CGTGAAAG CAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGTTGG 1873 

TTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACAC 3543 

IL II I Ml Jl! I Jl II II; I II II II lliilhlll 

TTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACAC 1933 
TTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGT 3603 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIMIIIIIMI 

TTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGT 1993 
CTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATT 3663 

IIIIMIIIIIIIIIIIIIIilllllllllllllllllllllllllllllllllllllll 

CTGATATTT CTTT CAGACTT CG C CAGACAGATTGCTGATAATAAATTAGGTAAGATAATT 2 053 
TGTTGGG CCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTG CTTGAATTG CAA 3723 

MIIIIIIIIIIMIIIIIIIMIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

TGTTGGG CCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTG CTTGAATTG CAA 2113 
GGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCAT 3783 

IMILU 'II III 'I II II II II ll'ill, II 'I IM'II 

GGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCAT 2173 
CATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAGACA 3843 

IIIIIMIIMIIIIIMIIIIIMIMIIIIIIIMIIIIMIIIIIIIIIIMIIIII 

CATATGCAGTGATATATG CCTATAATATAAG CCATAGGTT CACAC CATTTTGTTTAGACA 2233 
ATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATT 3903 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

ATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATT 22 93 
CAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAA 3963 

II MI'lll I III II II II II II || || || | Ml ,M 

CAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAA 2353 



Qy 3 964 CTGTGGTTTACTAGC^GGAATATTTCC^TTTCTACCTTTACTAC^TCTTTTCAACAAGT 4 023 

II I JIIIMM! II II I II M IMIIII il.UI II 

Db 2354 CTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGT 2413 

Qy 4 024 AACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAA 4 083 

IIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIII 

Db 2414 AACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAA 2473 

Qy 4 084 AATAAAAGTTTACAGAAACCTT 4105 

I 

Db 2474 AATAAAAGTTTACAGAAACCTT 24 95 



RESULT 9 
ABO 54 92 

ID ABK35492 standard; DNA; 2595 BP. 
XX 

AC ABK354 92; 
XX 

DT 08-MAY-2002 (first entry) 
XX 

DE Human endometrial cancer related gene, EDNRA . 
XX 

KW Human; ds ; gene; endometrial cancer; differential expression; 

KW DNA microarray; protein microarray. 

XX 

OS Homo sapiens. 
XX 

PN WO200209573-A2 . 
XX 

PD 07-FEB-2002. 
XX 

PF 31-JUL-2001; 2 001WO-US24 104 . 
XX 

PR 31-JUL-2000; 2000US-22 1735P . 
XX 

PA (BGHM ) BRIGHAM & WOMENS HOSPITAL INC. 
XX 

PI Mutter GL; 
XX 

DR WPI; 2002-179967/23. 

DR P-PSDB; AAU84272. 
XX 

PT Diagnosing endometrial cancer comprises determining expression of 

PT nucleic acid molecules or expression products that are differentially 

PT expressed in normal and malignant endometrium - 

XX 

PS Claim 1; Page 60-62; 233pp ; English. 
XX 

CC The invention relates to diagnosing endometrial cancer in a subject 

CC suspected of having endometrial cancer comprising determining the 

CC expression of a set of nucleic acid molecules or expression products in 

CC an endometrial sample suspected of being cancerous, where the set of 

CC nucleic acid molecules comprises at least 2 nucleic acid molecules 

CC selected from 50 fully defined sequences as given in the specification. 

CC The nucleic acids are used as an array of at least 2 of the 50 



CC nucleic acids bound to a solid substrate. Also included is a solid-phase 

CC protein microarray comprising at least 2 antibodies or its antigen 

CC binding fragments, that specifically bind at least 2 different 

CC polypeptides from the 50 fully defined sequences as given in the 

CC specification, fixed to a solid substrate. The methods and arrays are 

CC useful for the diagnosis of endometrial cancer, selecting and monitoring 

CC treatment regimes and identification of lead compounds useful for the 

CC treatment of endometrial cancer. The present sequence is one of 50 

CC genes differentially expressed between cancerous and non-cancerous 

CC samples. 

XX 

SQ Sequence 2595 BP; 788 A; 478 C; 459 G; 870 T; 0 other; 

Query Match 60.2%; Score 2470; DB 24; Length 2595; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2481; Conservative 0; Mismatches 0; Indels 1; Gaps 1; 
Qy 1624 CCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCAT 1683 

IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMMIIIII 

Db 15 CCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCAT 74 

Qy 1684 GAACGGAACAAGCATCCAGTGGAAGAACCACGATCA^ 1743 

Ml 1 1 M 1 1 1 i 1 1 1 1 1 1 M 1 1 ■ 1 1 1 1 II II M 1 1 Ml 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 

Db 75 GAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACC^ 134 

Qy 1744 C^GCC^TAAGGAC^GCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAAT 1803 

IMIMIIIIIIIIIIIIIIMIIIIIIMIIIMIIIMIIMIIMIIIIIIMIIII 

Db 135 CAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAAT 194 

Qy 1804 CCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTT 1863 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 195 CCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTT 2 54 

Qy 1864 CTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTT^ 1923 

1 1 1 1 1 1 1 i 1 1 1 1 1 1 J i 1 1 1 1 1 1 i I i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 ] 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 

Db 255 CTTCCTTAATTCACTCCC^CACCC^GAAGAAATGCTTTCCAAAACCGC^GGTAGACTG 314 

Qy 1924 GTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTC 1983 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 315 GTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTC 374 

Qy 1984 TGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAA 2 043 

M II II ii II II I II II II ;||':| || .1 li | | 

Db 375 TGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAA 434 

Qy 2 044 TGAAAC CAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATGG CTA 2103 

II MIMII MMM IM II IMMI Ml II I MM I MM MM 

Db 435 TGAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAGCTTTCAAGTACATGG CTA 4 94 

Qy 2104 GCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGA 2163 

MIIIIIIIIMIIIMIMIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIII 

Db 4 95 GCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGA 554 

Qy 2164 TTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATA 2223 

M II II Ml II II M II IM MMIMM M MMM II II ' 

Db 555 TTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATA 614 



Qy 


2224 


Db 


615 


Qy 


2284 


Db 


675 


Qy 


2344 


Db 


735 


Qy 


2404 


Db 


795 


Qy 


2464 


Db 


855 


Qy 


2524 


Db 


915 


Qy 


2584 


Db 


975 


Qy 


2644 


Db 


1034 


Qy 


2704 


Db 


1094 


Qy 


2764 


Db 


1154 


Qy 


2824 


Db 


1214 


Qy 


2884 


Db 


1274 


Ov 


2944 


Db 


1334 


Qy ' 


3004 


Db 


1394 



IIIIIIIIMIMIIIIIIIMMIIIIIMII IIIIIIMMIIIIIIIMIIMllll 



TAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAAT 2343 

ii i ; < i : i . 1 1 1 i : i i : 1 1 i ii.il. i ii. : 1 1 ; i ; 1 1 

TAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAAT 734 



I! I Ml IMII I lll'll IMI' I IMIulll'l IN I 'I 



CAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACATTT 2463 

M II I IM MMI II II II Mill IM II I Ml IM II 

CAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACATTT 854 
GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 2523 

I II M II I II II II II MMI I II M II M II 

GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 914 
GTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTC 2583 

M II M II I M II II II < M I II M II I I 

GTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTC 974 
AATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAAA 2643 

IIIIIIMMIIIIM MIMIIIIIIIII lllllllllllllllllllllllllllll 

AATCAGATAGTTCTTTTTCACAAGTTCAAT - CTGTTTTTCATGTAAATTTTGTATGAAAA 1033 
ATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCA 2703 

IIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

ATC^TGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCA 1093 



IIIMIIIIIMIIIIIIMIIIIIIMIIIIIIMMIIMIIIIIIIIIIIIIIIIM 



ATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCAC 2 823 

M I Ml M IM M II M I IM II II M II illMI M II 

ATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCAC 1213 
CTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTC 2883 

I I Ml M M I II II I II. i, ll I II M! II M II 

CTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTC 1273 
CCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAA 2 943 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 
CCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAA 1333 

AAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCA 3003 

M M I M Mill i; i IM I II M II I Ml M I 

AAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCA 13 93 

GTATGAACCTAACTCCCCACCCC^CATCTCCCTCCC^ 3063 

I I I M I I I I! I I I I I I I I II I I I I I I I I I I I I II I I I I I I I II II II I I I II I I I I I I 
GTATGAACCTAACTCCCC^CCCCAACATCTCCCTCCCACATTGT(^CCATTTCAAAGGGC 14 53 



Qy 



3 064 C CACAGTGACTTTTG CTGGGCATTTTCC CAGATGTTTACAGACTGTGAGTACAG CAGAAA 3123 



Db 


1454 


Qy 


3124 


Db 


1514 


Qy 


3184 


Db 


1574 


Qy 


3244 


Db 


1634 


Qy 


3304 


Db 


1694 


Qy 


3364 


Db 


1754 


Qy 


3424 


Db 


1814 


Qy 


3484 


Db 


1874 


Qy 


3544 


Db 


1934 


Qy 


3604 


Db 


1994 


Qy 


3664 


Db 


2054 


Qy 


3724 


Db 


2114 


Qy 


3784 


Db 


2174 


Qy 


3844 


Db 


2234 


Qy 


3904 



MIMIIIII IIIIIMIIIIIIIIIIIIIIIMIIIIIIII IIIIIIIMIIIIIMI! 

CCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAA 1513 
ATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATT 3183 

M II II Ml I IM I II II M II I Ml I Ml Ml 

ATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATT 1573 
TTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGT 3243 

II II II MMM Ml I! II II M M I II! I II Ml I 

TTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGT 1633 



Ml MMMMMIIMMMMI IMMMMMMMMMMMMIMMMMI 



TGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGT 3363 

1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 f 1 1 1 r i i r i 

TGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGT 1753 
AGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGA 3423 

IIIIMIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMMIIIIIIMII 

AGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGA 1813 
GGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGG 3483 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 r 1 1 1 r 1 1 f 1 1 1 1 r 1 1 1 1 1 1 f 

GGCAGCGTGAAAG CAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGTTGG 1873 
TTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACAC 3543 

IIIIIIIIIIIIIIMIMMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIII 

TTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACAC 1933 
TTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGT 3603 

1 1 1 1 1 1 1 1 1 f ] 1 1 1 1 1 f 1 1 1 ! 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 i 1 1 1 

TTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGT 1993 
CTGATATTT CTTT CAGACTTCG CCAGACAGATTG CTGATAATAAATTAGGTAAGATAATT 3663 

MIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMM 

CTGATATTTCTTTCAGACTTCG CCAGACAGATTG CTGATAATAAATTAGGTAAGATAATT 2 053 
TGTTGGG C CATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTG CTTGAATTGCAA 3723 

II II I i II - 1 : II., .1 1 I , III IE II j I I 

TGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGCAA 2113 
GGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCAT 3783 

IIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

GGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCAT 2173 
CATATG CAGTGATATATG CCTATAATATAAG CCATAGGTTCACACCATTTTGTTTAGACA 3843 

M MM I II M: Mill Ml II Mill I, IM MMMMI M 

(^TATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAGACA 2233 
ATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATT 3903 

IM MMM M M M" II IMMMM. IMM Ml MIMM MMM 

ATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATT 2293 
CAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAA 3963 

M MMM IMMIMI II MMM I II Mill II II MM 



Db 


2294 


Qy 


3964 


Db 


2354 


wy 


A 00 A 


Db 


2414 


Qy 


4084 


Db 


2474 



22 94 CAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAA 2353 
CTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGT 4 023 

i ii ii :n ii ;iM:MM i i; ,11 iMi i jii i in 

CTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGT 2413 
AACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAA 4 083 

II II II IIMMIh I 'II I IMII Mill I IM I I! 

AACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAA 2473 

AATAAAAGTTTACAGAAACCTT 4105 

Mill llllllllllllll 
AATAAAAGTTTACAGAAACCTT 24 95 



RESULT 10 
AAF20903 

ID AAF20903 standard; DNA; 1868 BP. 
XX 

AC AAF2 0 903; 
XX 

DT 14-MAR-2001 (first entry) 
XX 

DE Human low adenosine antisense oligonucleotide #2470. 
XX 

KW Low adenosine antisense oligonucleotide; phosphorothioate; allergy; 

KW human; airway disorder; bronchoconstriction; lung inf lammation; 

KW surfactant depletion; respiratory; bronchodilator; antiinflammatory; 

KW immunosuppressive; antiasthmatic; analgesic; hypotensive; cytostatic; 

KW respiratory obstruction; pulmonary obstruction; impeded respiration; 

KW surfactant hypoproduction; pulmonary vasoconstriction; asthma; RDS; 

KW respiratory distress syndrome; pain; cystic fibrosis; allergic rhinitis; 

KW pulmonary hypertension; emphysema; pulmonary transplantation rejection; 

KW chronic obstructive pulmonary disease; pulmonary infection; bronchitis; 

KW cancer; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200062736-A2 . 
XX 

PD 26-OCT-2000. 
XX 

PF 24-MAR-2000; 2 000WO-US08 02 0 . 
XX 

PR 06-APR-1999; 99US-0127958 . 
XX 

PA (UYEC-) UNIV EAST CAROLINA. 

PA (NYCE/) NYCE J W. 

XX 

PI Nyce JW; 
XX 

DR WPI; 2000-679539/66. 
XX 

PT Low adenosine (A) content antisense oligonucleotides which do not 

PT trigger adenosine receptors during metabolism, useful e.g. for treating 

PT cancers and respiratory obstructions - 

XX 



PS Disclosure; Page 676; 1592pp; English. 
XX 

CC The present invention describes low adenosine (A) content antisense 

CC oligonucleotides and compositions (I) comprising them. In the antisense 

CC oligonucleotides the A is replaced by a 'Universal' or alternative base. 

CC (I) can have respiratory, bronchodilator , antiinflammatory, analgesic, 

CC immunosuppressive, antiasthmatic, hypotensive and cytostatic activities. 

CC The antisense oligonucleotides and (I) can be used to down-regulate the 

CC expression and or activity of target polypeptides associated with 

CC lung/respiratory disorders and malignancies, such as stimulating and 

CC activating peptide factors and transmitters, transcription factors, 

CC immunoglobulins and antibodies, antibody receptors, cytokines and 

CC chemokines, endogenously produced specific and non-specific enzymes, 

CC binding proteins, adhesion molecules and their receptors, cytokine and 

CC chemokine receptors, adenosine receptors, bradykinin receptors, central 

CC nervous system (CNS) and peripheral nervous and non-nervous system 

CC receptors, CNS and peripheral nervous and non-nervous system peptide 

CC transmitters, defensins, growth factors, vasoactive peptides and 

CC receptors, binding proteins and malignancy associated proteins. The 

CC antisense oligonucleotides may be used in this way to treat disorders 

CC including respiratory obstruction (especially pulmonary obstruction 

CC and/or bronchoconstrict ion) and/or lung inflammation, allergy(ies) 

CC and/or surfactant hypoproduction which are associated with a disease or 

CC condition selected from pulmonary vasoconstriction, inflammation, 

CC allergies, asthma, impeded respiration, respiratory distress syndrome 

CC (RDS) , pain, cystic fibrosis (CF) , allergic rhinitis (AR) , pulmonary 

CC hypertension, emphysema, chronic obstructive pulmonary disease (COPD) , 

CC pulmonary transplantation rejection, pulmonary infections, bronchitis, 

CC and/or cancer. AAF18434 to AAF21543 represent human polynucleotide 

CC fragments and antisense oligonucleotides used in the exemplification of 

CC the present invention. 

XX 

SQ Sequence 1868 BP; 512 A; 419 C; 389 G; 548 T; 0 other; 

Query Match 44.3%; Score 1819.6; DB 21; Length 1868; 

Best Local Similarity 99.7%; Pred. No. 0; 

Matches 1844; Conservative 0; Mismatches 4; Indels 2; Gaps 2; 
Qy 421 AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTC^ 48 0 

III IIIIIIMIIIIMIIIIIII IIIIMIIIIIIIIIIIIilllMllllllllMI 

Db 10 AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 69 

Qy 481 CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

M; M II U I II U: II IUII II il 1 II; I III II 

Db 70 CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 129 

Qy 541 CAGTGATAATCCTGAGAGATACAGCACAAATCTAAGCAATC^ 600 

IIIIIIIIIIIIMIIIIIIIMIIIIIIMMIIMIIIMMIIIIIIIIIIIIIIII 

Db 130 CAGTGATAATCCTGAGAGATACAGC^CAAATCTAAGO^TCATGTGGATGATTTCACCAC 18 9 

Qy 601 TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

IMIIIIIIIMIIIIIIIIIIIMIIIIIIIIIMIMIIIIIIIIIIIIIIMIIIII ' 

Db 190 TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACC^CT(^TCAACCCACTAATTTGGTCCT 24 9 

Qy 661 ACCCAGCAATGGCTCAATGCACAACTATTGCCCACA 720 

I 1 ■ ' I I I I ■ i ■ ' ! ' i I ! . , ■ , ; 

Db 250 ACCCAGC^TGGCTCAATGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTT 3 09 



Qy 


721 


Db 


310 


Qy 


781 


Db 


370 


Qy 


841 


Db 


430 


Qy 


901 


Db 


490 


Qy 


961 


Db 


550 


Qy 


1021 


Db 


610 


Qy 


1081 


Db 


670 


Qy 


1141 


Db 


730 


Qy 


1201 


Db 


790 


Qy 


1261 


Db 


850 


Qy 


1321 


Db 


910 


Qy 


1381 


Db 


970 


Qy 


1441 


Db 


1030 


Qy 


1501 


Db 


1090 



CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIMMIIIIIIII 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 369 
AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

IIIIIIIIIIIIIIIMIIIIIMIIIIIMIIIIIIIIIIIIIIMIIMMIIIIIII 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 42 9 
AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

I Ml I II II M M II II I II II. I II II M 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 489 
TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

MIIIIIMIIIIMIIIIIIIMIIIIIIIIIIIIIIIMIMIIIIIIIIIMIIIII 

TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 54 9 
GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 1020 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMII 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 609 
TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 108 0 

II Mi I IMMI II Ml MM II M II III I II Ml 1 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 669 
TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

I M I MM II M M Ml II I II II M II Ml i 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 72 9 
TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

M II I Ml! II II Ml II! II I II IM M II ill ! 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 78 9 
TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 

I III I IMI I! M ill MM II M II III MM! Ml M 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 84 9 
CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 1320 

IIIIIIIIIMIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMII 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 909 
TTGTGAGATGTTGAACAGAAGGAATGGCAG CTTGAGAATTG C CCT CAGTGAACATCTTAA 1380 

IMII I M Mr M M II M. I II Ml M II M M 

TTGTGAGATGTTGAACAGAAGGAATGGCAG CTTGAGAATTG C CCTCAGTGAACATCTTAA 969 
GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

II IM I MM II I Ml II Ml !' II II I III III I 

G CAG CGT CGAGAAGTGG CAAAAACAGTTTT CTGCTTGGTTGTAATTTTTG CT CTTTG CTG 1029 
GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 15 00 

II IM I IMIMIMM II IMMMMMII II I II M 

GTTCCCTCTTCATTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAGATGGACAAGAA 1089 



M Ml I MMMM M II II II M :IM IM I II II M 



Qy 1561 CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 162 0 

IIIIIIIIMIIIIMIIIMIIIIIIMIIMIIIIMMIIIIIIIMIIIIIIIIII 

Db 1150 CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 12 09 



Qy 1621 TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

IIIIIIIIIIIIIIIMIMIIIMIIIIIIIIIMIIMMIIIIIIIIIIIIIIIMI 

Db 1210 TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1269 

Qy 1681 CATGAACGGAACAAGCATCCAGTGGAAGAACC^ 174 0 

Db 1270 CATGAACGGAACAAGCATCCAGTGGAAGAACC^ 132 9 

Qy 1741 GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 18 00 

I i I I I i i I: I ! 

Db 1330 GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1389 

Qy 18 01 AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

IIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIII IIIIIIIMIMIIIIIIIMI 

Db 13 90 AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGAGTCCGGGAATCTCTTCTCTGATC 144 9 

Qy 18 61 CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAA - GGTAG 1919 

IMIIIMIMIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII Mill 

Db 1450 CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGGTAG 1509 

Qy 192 0 ACTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACAT 1979 

I II M II I IM I M I II IIMIMM Mill Ml M 

Db 1510 ACTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACAT 1569 

Qy 198 0 ATTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGT 2 03 9 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIMI 

Db 157 0 ATTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGT 162 9 

Qy 204 0 TAAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATG 2 099 

M II! I II M II M IMMI IMM MIMM II MMM II 

Db 1630 TAAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATG 1689 

Qy 2100 GCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTA 2159 

M II I MM M Mill Ml I II M II II MMM I I 

Db 1690 GCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTA 174 9 

Qy 2160 GAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGA - TTTTTTGTCCTTCAGCCAAACA 2218 

I Ml I! M : M I II I 'M. I Ml II II Ml II I M! 

Db 1750 GAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTTGTCCTTCAGCCAAACA 1809 

Qy 2219 CAATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTAT 2268 

Mill M M IMM Ml I II MMI MMM 

Db 1810 CAATATGGGCTCAAGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTAT 1859 



RESULT 11 
AAF20915 

ID AAF20915 standard; DNA; 1868 BP. 
XX 

AC AAF20915; 
XX 

DT 14-MAR-2001 (first entry) 
XX 



DE Human endothelin receptor A polynucleotide fragment #2482. 
XX 

KW Low adenosine antisense oligonucleotide; phosphorothioate; allergy; 

KW human; airway disorder; bronchoconstriction; lung inflammation; 

KW surfactant depletion; respiratory; bronchodilator ; antiinflammatory; 

KW immunosuppressive; antiasthmatic; analgesic; hypotensive; cytostatic; 

KW respiratory obstruction; pulmonary obstruction; impeded respiration; 

KW surfactant hypoproduction; pulmonary vasoconstriction; asthma; RDS; 

KW respiratory distress syndrome; pain; cystic fibrosis; allergic rhinitis 

KW pulmonary hypertension; emphysema; pulmonary transplantation rejection; 

KW chronic obstructive pulmonary disease; pulmonary infection; bronchitis; 

KW cancer; ss. 

XX 

OS Homo sapiens . 
XX 

PN WO200062736-A2 . 
XX 

PD 26-OCT-2000. 
XX 

PF 24-MAR-2000; 2 000WO-US08 02 0 . 
XX 

PR 06-APR-1999; 99US-0127958 . 
XX 

PA (UYEC-) UNIV EAST CAROLINA . 

PA (NYCE/) NYCE J W. 

XX 

PI Nyce JW; 
XX 

DR WPI; 2000-679539/66. 
XX 

PT Low adenosine (A) content antisense oligonucleotides which do not 

PT trigger adenosine receptors during metabolism, useful e.g. for treating 

PT cancers and respiratory obstructions - 

XX 

PS Disclosure; Page 244; 1592pp; English. 
XX 

CC The present invention describes low adenosine (A) content antisense 

CC oligonucleotides and compositions (I) comprising them. In the antisense 

CC oligonucleotides the A is replaced by a 'Universal* or alternative base 

CC (I) can have respiratory, bronchodilator, antiinflammatory, analgesic, 

CC immunosuppressive, antiasthmatic, hypotensive and cytostatic activities 

CC The antisense oligonucleotides and (I) can be used to down-regulate the 

CC expression and or activity of target polypeptides associated with 

CC lung/respiratory disorders and malignancies, such as stimulating and 

CC activating peptide factors and transmitters, transcription factors, 

CC immunoglobulins and antibodies, antibody receptors, cytokines and 

CC chemokines, endogenously produced specific and non-specific enzymes, 

CC binding proteins, adhesion molecules and their receptors, cytokine and 

CC chemokine receptors, adenosine receptors, bradykinin receptors, central 

CC nervous system (CNS) and peripheral nervous and non-nervous system 

CC receptors, CNS and peripheral nervous and non-nervous system peptide 

CC transmitters, defensins, growth factors, vasoactive peptides and 

CC receptors, binding proteins and malignancy associated proteins. The 

CC antisense oligonucleotides may be used in this way to treat disorders 

CC including respiratory obstruction (especially pulmonary obstruction 

CC and/or bronchoconstriction) and/or lung inflammation, allergy (ies) 

CC and/or surfactant hypoproduction which are associated with a disease or 



CC condition selected from pulmonary vasoconstriction, inflammation, 

CC allergies, asthma, impeded respiration, respiratory distress syndrome 

CC (RDS) , pain, cystic fibrosis <CF) , allergic rhinitis (AR) , pulmonary 

CC hypertension, emphysema, chronic obstructive pulmonary disease (COPD) , 

CC pulmonary transplantation rejection, pulmonary infections, bronchitis, 

CC and/or cancer. AAF18434 to AAF21543 represent human polynucleotide 

CC fragments and antisense oligonucleotides used in the exemplification of 

CC the present invention. 

XX 

SQ Sequence 1868 BP; 512 A; 419 C; 389 G; 548 T; 0 other; 

Query Match 44.3%; Score 1819.6; DB 21; Length 1868; 

Best Local Similarity 99.7%; Pred. No. 0; 

Matches 1844; Conservative 0; Mismatches 4; Indels 2; Gaps 2; 

AAAAAGTGAAGGTGTAAAAG CAGCACAAGTG CAATAAGAGATATTT CCTCAAATTTG CCT 4 80 

IIIIIMIIIIMMIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIII 

AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 6 9 
CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

IM II I MM II I Ml Ml II M II MM I Ml IMMM 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 12 9 
CAGTGATAAT CCTGAGAGATACAGCACAAATCTAAG CAATCATGTGGATGATTT CAC CAC 600 

IMIIIIIIIIIIIIIMIIMIIIIMIIIIIIIMIIMIIIIIIIIIIMIIIIIII 

CAGTGATAATC CTGAGAGATACAG CACAAATCTAAGCAATCATGTGGATGATTTCACCAC 18 9 
TTTTCGTGGC^CAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

! ■ ' I i i : I ' I ■ I : ! 1 ! ! 1 I 1 : 

TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 24 9 
ACCCAGCAATGGCTCAATGCACAACTATTC 72 0 

II II I IM II MMM M II M IM Mi I! IIMII II 

ACCCAGCAATGGCTCAATGCACAACTATTGCCCACAGC^GACTAAAATTACT 3 09 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 

M Ml I II Ml M M IMMM I IM II Ml II IM II 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 369 



Qy 


421 


Db 


10 


Qy 


481 


Db 


70 


Qy 


541 


Db 


130 


Qy 


601 


Db 


190 


Qy 


661 


Db 


250 


Qy 


721 


Db 


310 


Qy 


781 


Db 


370 


Qy 


841 


Db 


430 


Qy 


901 


Db 


490 


Qy 


961 


Db 


550 


Qy 


1021 



MIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIII 



840 
429 



AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

I Ml IMIMMi II II III IIMI IM Ml II Ml II Ml 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 48 9 
TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

MIMIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIMI 

TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 54 9 
GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 1020 

IMIMIIMIMIIMIMIIIIIIIMMIIIIIIMIIIIIIMIIIMIIIIIIII 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 609 
TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 108 0 

I M IM M IM IMM Ml 'M II I I! II M' IMMM 



Db 610 TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 669 

Qy 1081 TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

IMIIIMIIMIMMIMIIMIIIIIIIIIIIMIIMIMMIIMIIIIIIIIII 

Db 67 0 TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 729 

Qy 1141 TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

M II M I I I IMMM M:; I M IMMMIMM I I 

Db 730 TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 789 

Qy 12 01 TATGCTCAATGCC^C^TCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 

Db 790 TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 84 9 

Qy 1261 CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

IIMIIIIIIIMIIMIIIIMMIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIII 

Db 850 CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 909 

Qy 1321 TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 138 0 

IIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIMMIIIIMIMIIMIIIIIIM 

Db 910 TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 96 9 

Qy 1381 GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIMIIIIIIMIIMIMIIIIIIIII 

Db 970 GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 102 9 

Qy 1441 GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 15 00 

1 1 1 i 1 1 1 1 1 1 1 i IIIIIIIMIIIIIIMIIIMIIIIIIIIIIIII 

Db 103 0 GTT CCCTCTTCATTTAAG C CGTATATTGAAGAAAACTGTGTATAACGAGATGGACAAGAA 108 9 

Qy 15 01 CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

Ml MINI III I II I II MINIUM III III III lllllllllllll III III III I 

Db 1090 CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 114 9 

Qy 1561 CATGAATTCATGTATAAACCCCATAG CT CTGTATTTTGTGAG CAAGAAATTTAAAAATTG 1620 

IIMIIIIIIMIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIMIIIMIIIIIII 

Db 115 0 CATGAATT CATGTATAAACC C CATAG CTCTGTATTTTGTGAG CAAGAAATTTAAAAATTG 12 09 

Qy 1621 TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 168 0 

MIMIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIMIIIMIIIII 

Db 1210 TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1269 

Qy 1681 CATGAACGGAACAAGCATCCAGTGGAAGAACC^ 174 0 

1 1 1 1 1 1 1 1 f 1 1 r 1 1 1 r r i r 1 1 1 i 1 1 1 1 1 r 1 1 1 1 1 j i m 1 1 f 1 1 ( 1 1 1 1 1 1 1 1 1 1 r 1 1 1 j j 

Db 12 70 C^TGAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACCACAACACAGACCG 132 9 

Qy 1741 GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

IMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIII 

Db 133 0 GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 138 9 

Qy 1801 AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

MMIIMIIIIIIMIIIMIIIIIIIMIIIIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 13 90 AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGAGTCCGGGAATCTCTTCTCTGATC 144 9 

Qy 1861 CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAA-GGTAG 1919 

I M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 14 50 CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGGTAG 1509 



Qy 192 0 ACTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACAT 197 9 

1 ' > I 1 ! 1 : I I : 

Db 1510 ACTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACAT 1569 

Qy 1980 ATTCTG CGTGTTGTATTCAG CACTAAAAAATGGTGGGAG CTGGGGGAGAATGAAGACTGT 203 9 

MIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIMIII 

Db 157 0 ATTCTG CGTGTTGTATT CAG CACTAAAAAATGGTGGGAG CTGGGGGAGAATGAAGACTGT 162 9 

Qy 2 04 0 TAAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATG 2099 

MIIIIIIIIIIIIMMIMIIIIIIMIIMIIIIIMIIMIIIIIIIIIIMIMI 

Db 163 0 TAAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATG 168 9 

Qy 2100 GCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTA 2159 

IIIIIIMIMIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIMIIIMIMIIM 

Db 1690 GCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTA 174 9 

Qy 2160 GAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGA - TTTTTTGTCCTTCAGCCAAACA 2218 

I III I III II! II! I II I IIMI! IIIIMI llllll 1 1 III Mil III lllllll II 

Db 175 0 GAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTTGTCCTTCAGCCAAACA 18 09 

Qy 2219 CAATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTAT 2268 

I i ; I IIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIII 

Db 1810 CAATATGGGCTCAAGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTAT 1859 



RESULT 12 
AAA34781 

ID AAA34781 standard; DNA; 1868 BP. 
XX 

AC AAA34781; 
XX 

DT 28-JUL-2000 (first entry) 
XX 

DE Human adenosine receptor related polynucleotide SEQ ID NO: 2470. 
XX 

KW Human; adenosine receptor; low adenosine antisense oligonucleotide; 

KW phosphorothioate; impaired respiration; inflammation; allergy; 

KW allergic disease; bronchoconstriction; inhibitor; antiinflammatory; 

KW antiallergic; antiasthmatic; cytostatic; analgesic; impaired airway; 

KW lung disease; ischaemic condition; pulmonary vasoconstriction; asthma; 

KW respiratory distress syndrome; pain; cystic fibrosis; emphysema; 

KW pulmonary hypertension; chronic obstructive pulmonary disease; COPD; 

KW cancer; leukaemia; lymphoma; carcinoma; metastasis; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200009525-A2 . 
XX 

PD 24-FEB-2000. 
XX 

PF 03-AUG-1999; 99WO-US17712 . 
XX 

PR 03-AUG-1998; 98US-00952 12 . 
XX 

PA (UYEC-) UNIV EAST CAROLINA. 
XX 



PI Nyce JW; 
XX 

DR WPI; 2000-205971/18. 
XX 

PT New antisense oligonucleotides useful for treating e.g. pulmonary 

PT vasoconstriction, inflammation, allergies, asthma, hypertension, 

PT bronchitis, emphysema, respiratory distress syndrome, ischemia or 

PT cancers 
XX 

PS Disclosure; Page 606-607; 1343pp; English. 
XX 

CC The present invention describes a new composition comprising an 

CC antisense oligonucleotide (ON) with low adenosine (up to 15%) , which 

CC targets nucleic acids involved in bronchoconstrict ion, allergies, and/or 

CC inflammation. The ON can have antiinflammatory, antiallergic, 

CC antiasthmatic, cytostatic and analgesic activities. The compositions are 

CC useful for the treatment of diseases associated with inflammation, 

CC impaired airways, including lung disease and diseases whose secondary 

CC effects afflict the lungs of a subject. They can be used for treating 

CC e.g. ischaemic conditions, pulmonary vasoconstriction, allergies, 

CC asthma, impeded respiration, respiratory distress syndrome, pain, cystic 

CC fibrosis, pulmonary hypertension, emphysema, chronic obstructive 

CC pulmonary disease (COPD) , and cancers such as leukaemias, lymphomas, 

CC carcinomas, and cancers which may metastasise to the lungs, including 

CC breast and prostate cancer. The reduction of the adenosine content of 

CC the ONs reduces side effects. The A-containing ONs break down with the 

CC release of deoxyadenosine which activates adenosine receptors causing 

CC bronchoconstriction and inflammation. AAA32313 to AAA35312 represent the 

CC nucleotide sequences given in the sequence listing from the present 

CC invention, which correspond to SEQ ID NO:l to 2815, and then the last 

CC 185 sequences are also called SEQ ID NO:l to 185, but the sequences 

CC differ from the previously named sequences. SEQ ID NO: 11 to 1680 

CC (AAA32323 to AAA33992) are specifically claimed ONs from the present 

CC invention. N.B. Sequences given in the disclosure of the present 

CC invention do not match up with their corresponding SEQ ID NO: sequences 

CC given in the sequence listing. 

XX 

SQ Sequence 1868 BP; 512 A; 419 C; 389 G; 548 T; 0 other; 



Query Match 44.3%; Score 1819.6; DB 21; Length 1868; 

Best Local Similarity 99.7%; Pred. No. 0; 

Matches 1844; Conservative 0; Mismatches 4; Indels 2; Gaps 2; 



Qy 


421 


AAAAAGTGAAGGTGTAAAAGC^GCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 

IIIMIIIMIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIMIIIIIIII 

AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGAT^ 


480 


Db 


10 


69 


Qy 


481 


CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 

IMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIMIIIIIIIM 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 


540 


Db 


70 


129 


Qy 


541 


CAGTGATAATCCTGAGAGATACAGCACTW^TCTAAGC^ATCATGTGGATGATTTCACCAC 

nil hill II 1 'III 1 II 1 ,IIM ,11 IHI II 

CAGTGATAATCCTGAGAGATAC^GCACAAATCTAAGCA 


600 


Db 


130 


189 


Qy 


601 


TTTTCGTGGCA.CAGAGCTC^GCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 

MM II INN III II II I'll II II Mill Ml MM III II II MM III INN MM 


660 



Db 



190 TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 24 9 



Qy 


661 


Db 


250 


Qy 


721 


Db 


310 


Qy 


781 


Db 


370 


Qy 


841 


Db 


430 


Qy 


901 


Db 


490 


Qy 


961 


Db 


550 


Qy 


1021 


Db 


610 


Qy 


1081 


Db 


670 


Qy 


1141 


Db 


730 


Qy 


1201 


Db 


790 


Qy 


1261 


Db 


850 


Qy 


1321 


USJ 


Qi n 

J x U 


Qy 


1381 


Db 


970 


Qy 


1441 


Db 


1030 



ACCCAG CAATGG CTCAATG CACAACTATTG C CCACAG CAGACTAAAATTACTTCAG CTTT 720 

imimm mmmmmmmmmimimi mmiiimi mmimmimi 

ACCCAGC^TGGCTC^TGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTT 3 09 
CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIMI 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 369 
AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

i ; . i I i I i I ■ ^ : : . I 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 42 9 
AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

IIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIII 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 48 9 
TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIMI 1 1 1 1 1 1 MM 1 1 1 II 1 1 1 1 1 1 1 1. 1 1 1 II 1 1 1 1 II I II 

TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 54 9 
GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 1020 

IIMIIMIIIIMIIIIIMIIIIMIIIIIIIIIIMIIIIMMIIIIMMIIMI 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 609 
TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 f 1 1 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 669 
TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 1140 

1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 e f 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 72 9 
TGAAG CGATTGG CTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAG CATAAAAC CTG 12 00 

IIIIMMMMIMIMMIIIIIIIMMIMIMIIIIMIIIIIIIIIIIIIIIII 

TGAAG CGATTGG CTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAG CATAAAAC CTG 789 
TATGCTCAATGCC^C^TCAAAATTCATGGAGTTCTACC^AGATGTAAAGGACTGGTGGCT 1260 

IIIIMMIIMIMIIIMMIIIIIMIIIIIMIIIIIMIIIIIIIIIIIIIIIII 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 84 9 
CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 1320 

1 1 1 1 1 1 1 1 i 1 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ] 1 1 1 1 1 1 1 E i 1 1 1 1 E 1 1 ! E 1 1 1 1 ! I f 1 1 1 E I 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 909 
TTGTGAGATGTTGAACAGAAGGAATGGCAG CTTGAGAATTG C CCTCAGTGAACATCTTAA 138 0 

1 1 1 1 1 E 1 1 1 1 1 1 1 1 i I i 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 E 1 1 1 1 1 1 1 E ! E E 1 1 ! 1 1 

TTGTGAGATGTTGAACAGAAGGAATGGCAG CTTGAGAATTG C CCTCAGTGAACAT CTTAA 969 
GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

II II Ml I Mi II I! II J! 'il I I! II! II 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 1029 
GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

1 1 1 1 1 1 1 1 1 1 1 1 I IM: I MM M, I i i M MM Mi 

GTTCCCTCTTCATTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAGATGGACAAGAA 1089 



Qy 


1501 


Db 


1090 


Qy 


1561 


Db 


1150 


Qy 


1621 


Db 


1210 


Qy 


1681 


Db 


1270 


Qy 


1741 


Db 


1330 


Qy 


1801 


Db 


1390 


Qy 


1861 


Db 


1450 


Qy 


1920 


Db 


1510 


Qy 


1980 


Db 


1570 


Qy 


2040 


Db 


1630 


Qy 


2100 


Db 


1690 


Qy 


2160 


Db 


1750 


Qy 


2219 


Db 


1810 



CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

I li ihih MM :ll II Mi I MM II I II: !l I 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 114 9 
CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 1620 

I Ml I IM II I I: II II M II II I II II 

CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 12 09 
TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

IIIMIIMMIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIMIIIIIIIIII 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1269 



'I I! I II M I II :i II I II '|| | || MM 



GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

I MM ! Ml II I II II II I II II | || Ml | 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1389 



II M I M II I II II II I Ml M M IMM 



CTTCTTCCTTAATTC^CTCC(^C^CCC^GAAGAAATGCTTTCC^AAACCGCAA-GGTAG 1919 

I M IMM IMil Ml IIMH M MM II: I Ml Mill 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGC^ 1509 

ACTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACAT 1979 

I IM I III II I Ml II IM II II I Ml Ml I 

ACTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACAT 1569 



II I II Ml I M II II I II 'M M M IM 



TAAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATG 2 099 

1 1 1 M 1 1 1 1 1 II II II I M II II I II II I M II II I M I II 1 1 1 1 II 1 1 1 II II I II II I 

TAAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATG 168 9 
GCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTA 2159 

MIIIIIIIIIIIIIIIIIIMIIMIMIIIIIIIIIIIIMIIIIIIMIIIIIIMI 

GCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTA 174 9 
GAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGA - TTTTTTGTCCTTCAGCCAAACA 2218 

M 1 1 1 M 1 1 1 1 II II I II 1 1 1 M I II 1 1 II I II 1 1 1 1 I II Mill Ml II 

GAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTTGTCCTTCAGCCAAACA 18 09 

CAATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTAT 2268 

I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I II I I I I I I I II I 
CAATATGGGCTCAAGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTAT 1859 



RESULT 13 
AAA34793 

ID AAA34793 standard; DNA; 1868 BP. 



XX 

AC AAA34793; 
XX 

DT 28-JUL-2000 (first entry) 
XX 

DE Human adenosine receptor related polynucleotide SEQ ID NO: 2482. 
XX 

KW Human; adenosine receptor; low adenosine antisense oligonucleotide; 

KW phosphorothioate; impaired respiration; inflammation; allergy; 

KW allergic disease; bronchoconstrict ion; inhibitor; antiinflammatory; 

KW antiallergic; antiasthmatic; cytostatic; analgesic; impaired airway; 

KW lung disease; ischaemic condition; pulmonary vasoconstriction; asthma; 

KW respiratory distress syndrome; pain; cystic fibrosis; emphysema; 

KW pulmonary hypertension; chronic obstructive pulmonary disease; COPD; 

KW cancer; leukaemia; lymphoma; carcinoma; metastasis; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200009525-A2. 
XX 

PD 24-FEB-2000. 
XX 

PF 03-AUG-1999; 99WO-US17712 . 
XX 

PR 03-AUG-1998; 98US - 00952 12 . 
XX 

PA (UYEC-) UNIV EAST CAROLINA . 
XX 

PI Nyce JW; 
XX 

DR WPI; 2000-205971/18. 
XX 

PT New antisense oligonucleotides useful for treating e.g. pulmonary 

PT vasoconstriction, inflammation, allergies, asthma, hypertension, 

PT bronchitis, emphysema, respiratory distress syndrome, ischemia or 

PT cancers 
XX 

PS Disclosure; Page 645-646; 1343pp ; English. 
XX 

CC The present invention describes a new composition comprising an 

CC antisense oligonucleotide (ON) with low adenosine (up to 15%) , which 

CC targets nucleic acids involved in bronchoconstrict ion, allergies, and/or 

CC inflammation. The ON can have antiinflammatory, antiallergic, 

CC antiasthmatic, cytostatic and analgesic activities. The compositions are 

CC useful for the treatment of diseases associated with inflammation, 

CC impaired airways, including lung disease and diseases whose secondary 

CC effects afflict the lungs of a subject. They can be used for treating 

CC e.g. ischaemic conditions, pulmonary vasoconstriction, allergies, 

CC asthma, impeded respiration, respiratory distress syndrome, pain, cystic 

CC fibrosis, pulmonary hypertension, emphysema, chronic obstructive 

CC pulmonary disease (COPD), and cancers such as leukaemias, lymphomas, 

CC carcinomas, and cancers which may metastasise to the lungs, including 

CC breast and prostate cancer. The reduction of the adenosine content of 

CC the ONs reduces side effects. The A-containing ONs break down with the 

CC release of deoxyadenosine which activates adenosine receptors causing 

CC bronchoconstrict ion and inflammation. AAA32313 to AAA35312 represent the 

CC nucleotide sequences given in the sequence listing from the present 



CC invention, which correspond to SEQ ID N0:1 to 2815, and then the last 

CC 185 sequences are also called SEQ ID N0:1 to 185, but the sequences 

CC differ from the previously named sequences. SEQ ID NO: 11 to 1680 

CC (AAA32323 to AAA33992) are specifically claimed ONs from the present 

CC invention. N.B. Sequences given in the disclosure of the present 

CC invention do not match up with their corresponding SEQ ID NO: sequences 

CC given in the sequence listing. 

XX 

SQ Sequence 1868 BP; 512 A; 419 C; 389 G; 548 T; 0 other; 

Query Match 44.3%; Score 1819.6; DB 21; Length 1868; 

Best Local Similarity 99.7%; Pred. No. 0; 

Matches 1844; Conservative 0; Mismatches 4; Indels 2; Gaps 2; 
Qy 421 AAAAAGTGAAGGTGTAAAAGCAG CACAAGTGCAATAAGAGATATTTCCT CAAATTTGCCT 480 

II II I I MIMMI IIMI Mill II II II IIMMMMI I M 

Db 10 AAAAAGTGAAGGTGTAAAAG CAG CACAAGTG CAATAAGAGATATTTCCTCAAATTTG CCT 69 

Qy 481 CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

MIIIIIIIIIIMIIIIIIIIIIIMIIIIIMIIMIIIIMIIIIIIIIIIIIIIII 

Db 70 CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 12 9 

Qy 541 CAGTGATAATCCTGAGAGATACAGCACAAATCTAAGCAATC^ 600 

IIIMIIIIIIIIMIIIIMIIIIIIIIIMIIIIIIIIIIIMIIMMIMIIIIII 

Db 13 0 CAGTGATAATCCTGAGAGATAC^GCACAAATCTAAGC^TC^TGTGGATGATTTCACCAC 189 

Qy 601 TTTTCGTGGCACAGAGCTCAGCTTCCTGGTT^ 660 

MMIMIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIII 

Db 190 TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACT 24 9 

Qy 661 ACCCAGCAATGGCTCAATGCACAACTAT^ 72 0 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 250 ACCCAGCAATGGCTCAATGCACAACTATTC 3 09 

Qy 721 CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 

! I MMI M M M I I I MM MM II IE II Mill I! 

Db 310 CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 369 

Qy 781 AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIMIIIIMIIIIIIIIIIIIIIIII 

Db 370 AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 42 9 

Qy 841 AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 43 0 AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 489 

Qy 901 TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

MIIIIIIIIIIIIMIIIIIMIIIMIIIIIIIIIIIIIIIMIIMIIIIIIIIMI 

Db 490 TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 54 9 

Qy 961 GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

1 1 1 1 1 1 1 I I I I I I I I I M M I I I I II I I I II I I I I II I I I | M I I I I I I I I M I I I I I I | 
Db 550 GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 609 

Qy 1021 TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I M I I I 
Db 610 TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 669 



Qy 


1081 


Db 


670 


Qy 


1141 


Db 


730 


Qy 


1201 


Db 


790 


Qy 


1261 


Db 


850 


Qy 


1321 


Db 


910 


Qy 


1381 


Db 


970 


Qy 


1441 


Db 


1030 


Qy 


1501 


Db 


1090 


Qy 


1561 


Db 


1150 


Qy 


1621 


Db 


1210 


Qy 


1681 


Db 


1270 


Qy 


1741 


Db 


1330 


Qy 


1801 


Db 


1390 


Qy 


1861 


Db 


1450 



TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

IM I Ml IMII II M IM I II MM MMIMMMI Ml II 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 72 9 
TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIII 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 789 
TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 

M IMII M: I Ml M M I MMMI MMIMi Ml M IM 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 84 9 
CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

MIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIMIIIIMIIIIIIIIMIIII 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 909 
TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 1380 

IMIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 969 
GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

IM Mill Mill IM I MMM II MM I IMMI II IMII 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 102 9 
GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

I Ml MM I III M II IMII II M II lllllllllll 

GTTCCCTCTTCATTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAGATGGACAAGAA 1089 
CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

IIIIIIIIIIIIMIIIIIIIMIIMIIMIIMMIIIIMIIMIIIMIIMIIII 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 114 9 
CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 1620 

MIMIIIIMIIIIIIMIIIIIIIIIMIMIIIIIMIIIIIIIIMIIIIIIIIII 

CATGAATTCATGTATAAACCC CATAG CTCTGTATTTTGTGAG CAAGAAATTTAAAAATTG 1209 
TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

M MUM IM I II II II II Mill lull IMII II I 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1269 
CATGAACGGAACAAGC^TCCAGTGGAAGAACCA.CGATCAAAACAACCACAAC^CAGACCG 174 0 

i i 1 1 i 1 : ' ! : i : i i i ! i 1 i 

CATGAACGGAAC^GCATCCAGTGGAAGAACCACGATCAAAACAACCACAACACAGACCG 1329 



M Ml M MM MM | |[, ,|| | j| | || : , || |! || '| 



AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCX3GGAATCTCTTCTCTGATC 1860 

1 1 1 ( 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 J I f I J I f 1 1 1 1 1 1 1 1 ! I MM Ml I II 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGAGTCCGGGAATCTCTTCTCTGATC 144 9 
CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAA-GGTAG 1919 

M M M, II M. IM IMIMMI M Ml IMIIMM M IMII 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGGTAG 1509 



Qy 192 0 ACTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACAT 1979 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIII 

Db 1510 ACTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACAT 1569 

Qy 198 0 ATTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGT 203 9 

I'M II M II I II II II I! II !| IIMII I II II 

Db 1570 ATTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGT 1629 

Qy 2 04 0 TAAATGAAACCAGAAGGATATTTACTACTTTTGC^VTGAAAATAGAGCTTTCAAGTACATG 2099 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIII 

Db 1630 TAAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATG 168 9 

Qy 2100 GCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTA 2159 

IIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1690 GCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTA 174 9 

Qy 2160 GAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGA - TTTTTTGTCCTTCAGCCAAACA 2218 

IIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIMI llllllllllllllllllllll 

Db 175 0 GAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTTGTCCTTCAGCCAAACA 1809 

Qy 2219 CAATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTAT 22 68 

II III MM II I II II I II II I MIM! 

Db 1810 CAATATGGGCTCAAGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTAT 1859 



RESULT 14 
AAF20904 

ID AAF20904 standard; DNA; 2008 BP. 
XX 

AC AAF2 0904; 
XX 

DT 14-MAR-2001 (first entry) 
XX 

DE Human low adenosine antisense oligonucleotide #2471. 
XX 

KW Low adenosine antisense oligonucleotide; phosphorothioate; allergy; 

KW human; airway disorder; bronchoconstrict ion; lung inflammation; 

KW surfactant depletion; respiratory; bronchodilator ; antiinflammatory; 

KW immunosuppressive; antiasthmatic; analgesic; hypotensive; cytostatic; 

KW respiratory obstruction; pulmonary obstruction; impeded respiration; 

KW surfactant hypoproduct ion; pulmonary vasoconstriction; asthma; RDS; 

KW respiratory distress syndrome; pain; cystic fibrosis; allergic rhinitis; 

KW pulmonary hypertension; emphysema; pulmonary transplantation rejection; 

KW chronic obstructive pulmonary disease; pulmonary infection; bronchitis; 

KW cancer; ss. 

XX 

OS Homo sapiens . 
XX 

PN WO2 00062736-A2. 
XX 

PD 26-OCT-2000. 
XX 

PF 24-MAR-2000; 2000WO-US08020 . 
XX 

PR 06-APR-1999; 99US- 0127958 . 
XX 

PA (UYEC-) UNIV EAST CAROLINA. 



PA (NYCE/) NYCE J W. 
XX 

PI Nyce JW; 
XX 

DR WPI; 2000-679539/66. 
XX 

PT Low adenosine (A) content antisense oligonucleotides which do not 

PT trigger adenosine receptors during metabolism, useful e.g. for treating 

PT cancers and respiratory obstructions - 

XX 

PS Disclosure; Page 676-677; 1592pp ; English. 
XX 

CC The present invention describes low adenosine (A) content antisense 

CC oligonucleotides and compositions (I) comprising them. In the antisense 

CC oligonucleotides the A is replaced by a 'Universal' or alternative base. 

CC (I) can have respiratory, bronchodilator, antiinflammatory, analgesic, 

CC immunosuppressive, antiasthmatic, hypotensive and cytostatic activities. 

CC The antisense oligonucleotides and (I) can be used to down-regulate the 

CC expression and or activity of target polypeptides associated with 

CC lung/respiratory disorders and malignancies, such as stimulating and 

CC activating peptide factors and transmitters, transcription factors, 

CC immunoglobulins and antibodies, antibody receptors, cytokines and 

CC chemokines, endogenously produced specific and non-specific enzymes, 

CC binding proteins, adhesion molecules and their receptors, cytokine and 

CC chemokine receptors, adenosine receptors, bradykinin receptors, central 

CC nervous system (CNS) and peripheral nervous and non-nervous system 

CC receptors, CNS and peripheral nervous and non-nervous system peptide 

CC transmitters, defensins, growth factors, vasoactive peptides and 

CC receptors, binding proteins and malignancy associated proteins. The 

CC antisense oligonucleotides may be used in this way to treat disorders 

CC including respiratory obstruction (especially pulmonary obstruction 

CC and/or bronchoconstrict ion) and/or lung inflammation, allergy (ies) 

CC and/or surfactant hypoproduction which are associated with a disease or 

CC condition selected from pulmonary vasoconstriction, inflammation, 

CC allergies, asthma, impeded respiration, respiratory distress syndrome 

CC (RDS) , pain, cystic fibrosis (CF) , allergic rhinitis (AR) , pulmonary 

CC hypertension, emphysema, chronic obstructive pulmonary disease (COPD) , 

CC pulmonary transplantation rejection, pulmonary infections, bronchitis, 

CC and/or cancer. AAF18434 to AAF21543 represent human polynucleotide 

CC fragments and antisense oligonucleotides used in the exemplification of 

CC the present invention. 

XX 

SQ Sequence 2008 BP; 563 A; 463 C; 416 G; 566 T; 0 other; 

Query Match 44.3%; Score 1819.6; DB 21; Length 2008; 

Best Local Similarity 99.7%; Pred. No. 0; 

Matches 1844; Conservative 0; Mismatches 4; Indels 2; Gaps 2; 
Qy 421 AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 48 0 



Db 




Qy 



4 81 CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 



Db 




Qy 541 CAGTGATAATCCTGAGAGATAC^GC^CAAATCTAAGC^TCATGTGGATGATTTC^CaVC 600 



Db 


270 


Qy 


601 


Db 


330 


Qy 


661 


Db 


390 


Qy 


721 


Db 


450 


Qy 


781 


Db 


510 


Qy 


841 


Db 


570 


Qy 


901 


Db 


630 


Qy 


961 


Db 


690 


Qy 


1021 


Db 


750 


Qy 


1081 


Db 


810 


Qy 


1141 


Db 


870 


Qy 


1201 


Db 


930 


Qy 


1261 


Db 


990 


Qy 


1321 


Db 


1050 


Qy 


1381 



IMM IMM MM IMIIMIMIMI I, IMIMIIMM I I II 

CAGTGATAATCCTGAGAGATACAG CACAAATCTAAG CAATCATGTGGATGATTTCAC CAC 329 



MIMM i I I M II II II M I I I II II MMM I 



I IMIM MM M I II II II II II II MIMI II M I M 



CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 

MM II M I M MMMMIMM I M M II IMIM MM I 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 509 



I IMM I II II II I M M II II II II M II I 



AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

IMM I II I II MIMMIMM I M I Ml II II MM M 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 629 



IIIIMIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMI 



GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

IIIIIIIIIIIIIIMIMIIMIIIIMIIIIIIMIIIIIIIIIIIIIMIIMIIII 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 74 9 
TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

IMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIMIIIMIII 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 8 09 
TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

IMIIIIIIIIIIIIIIMIIIIIIIMIIIMIIIIIIIIMIMMIIIIIIIIIIII 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 869 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMMIIIIIIIIIIII 



TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 

IIIIIIIIIIIIIIMIIMIIIIIIIIIIIIMIIIIIIIMIIIIIIMIIIIIMM 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 98 9 



I II M 1 1 1 1 1 III I I M I II I II M 1 1 III Ml I Ml 1 1 II 1 1 II IMIM III 1 1 1 I 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 
TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 

1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I : I I ll MM 1 1 M 1 1 1, 1 II II 1 1 1 II 1 1 1 II I 

TTGTGAGATGTTGAACAGAAGGAATGG CAG CTTGAGAATTG C CCTCAGTGAACATCTTAA 
GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 

IIIMIIIIIMIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIMIIIII 



Db 



1110 GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 1169 



Qy 


1441 


Db 


1170 


Qy 


1501 


Db 


1230 


Qy 


1561 


Db 


1290 


Qy 


1621 


Db 


1350 


Qy 


1681 


Db 


1410 


Qy 


1741 


Db 


1470 


Qy 


1801 


Db 


1530 


Qy 


1861 


Db 


1590 


Qy 


1920 


Db 


1650 


Qy 


1980 


Db 


1710 


Qy 


2040 


Db 


1770 


Qy 


2100 


nh 


i fi 7 n 


Qy 


2160 


Db 


1890 


Qy 


2219 


Db 


1950 



ii ii; mi 'L ii ii i ii in i it i nun 



CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1289 
CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 1620 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 134 9 



IIIIIIIIIMIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



CATGAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACCACAACACAG 174 0 

Mill ,1 Ii II :l II I I ,11 M II ill, I II IhMII 

CATGAA CGGAACAAG CAT C CAGTGGAAGAA C CA CGAT CAAAA CAA C CACAA CACAGAC CG 1469 



IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

IMMIIIIIIIMIIIIIIIIIIIIIIIIIMIIII llllllllllllllllllllll 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGAGTCCGGGAATCTCTTCTCTGATC 1589 
CTTCTTCCTTAATTCACTCC(^CACCCAAGAAGAAATGCTTTCCAAAACCGC^\-GGTAG 1919 

IMIIIIIIMIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII Mill 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGGTAG 164 9 
ACTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACAT 1979 

IMIMll II M I II Mllllll M II IM I IMIII Ml 

ACTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACAT 1709 



iiimiimiiiiiiimiiiiiiiiiiimiiiiiiiiiiiiiiiiiiiiiiiiimi 



TAAATGAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATG 2 0 99 

MIIIIMIIIIIIIIIIIMIIIIIIIIIIMIIIIIMIIIIIMIIIIIIIIIIIII 

TAAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATG 1829 
GCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTA 2159 

IIIMIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

GCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTA 1889 
GAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGA-TTTTTTGTCCTTCAGCCAAACA 2218 

I I MMI h M. M | || II 1 llllllllllllllllllllll 

GAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTTGTCCTTCAGCCAAACA 194 9 
CAATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTAT 2268 

Illllllllllll IIMIIMMIMMIIIIIIIIIIIIIIMIIMI 

CAATATGGGCTCAAGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTAT 1999 



RESULT 15 
AAA34782 

ID AAA34782 standard; DNA; 2008 BP. 
XX 

AC AAA34782; 
XX 

DT 28-JUL-2000 (first entry) 
XX 

DE Human adenosine receptor related polynucleotide SEQ ID NO: 2471. 
XX 

KW Human; adenosine receptor; low adenosine antisense oligonucleotide; 

KW phosphorothioate; impaired respiration; inflammation; allergy; 

KW allergic disease; bronchoconstriction; inhibitor; antiinflammatory ; 

KW antiallergic; antiasthmatic; cytostatic; analgesic; impaired airway; 

KW lung disease; ischaemic condition; pulmonary vasoconstriction; asthma; 

KW respiratory distress syndrome; pain; cystic fibrosis; emphysema; 

KW pulmonary hypertension; chronic obstructive pulmonary disease; COPD; 

KW cancer; leukaemia; lymphoma; carcinoma; metastasis; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200009525-A2. 
XX 

PD 24-FEB-2000. 
XX 

PF 03-AUG-1999; 99WO-US17712 . 
XX 

PR 03-AUG-1998; 98US- 0095212 . 
XX 

PA (UYEC-) UNIV EAST CAROLINA. 
XX 

PI Nyce JW; 
XX 

DR WPI; 2000-205971/18. 
XX 

PT New antisense oligonucleotides useful for treating e.g. pulmonary 

PT vasoconstriction, inflammation, allergies, asthma, hypertension, 

PT bronchitis, emphysema, respiratory distress syndrome, ischemia or 

PT cancers 
XX 

PS Disclosure; Page 607; 1343pp ; English. 
XX 

CC The present invention describes a new composition comprising an 

CC antisense oligonucleotide (ON) with low adenosine (up to 15%) , which 

CC targets nucleic acids involved in bronchoconstriction, allergies, and/or 

CC inflammation. The ON can have antiinflammatory, antiallergic, 

CC antiasthmatic, cytostatic and analgesic activities. The compositions are 

CC useful for the treatment of diseases associated with inflammation, 

CC impaired airways, including lung disease and diseases whose secondary 

CC effects afflict the lungs of a subject. They can be used for treating 

CC e.g. ischaemic conditions, pulmonary vasoconstriction, allergies, 

CC asthma, impeded respiration, respiratory distress syndrome, pain, cystic 

CC fibrosis, pulmonary hypertension, emphysema, chronic obstructive 

CC pulmonary disease (COPD), and cancers such as leukaemias, lymphomas, 

CC carcinomas, and cancers which may metastasise to the lungs, including 



CC breast and prostate cancer. The reduction of the adenosine content of 

CC the ONs reduces side effects. The A-containing ONs break down with the 

CC release of deoxyadenosine which activates adenosine receptors causing 

CC bronchoconstriction and inflammation. AAA32313 to AAA35312 represent the 

CC nucleotide sequences given in the sequence listing from the present 

CC invention, which correspond to SEQ ID N0:1 to 2815, and then the last 

CC 185 sequences are also called SEQ ID N0:1 to 185, but the sequences 

CC differ from the previously named sequences. SEQ ID NO: 11 to 1680 

CC (AAA32323 to AAA33992) are specifically claimed ONs from the present 

CC invention. N.B. Sequences given in the disclosure of the present 

CC invention do not match up with their corresponding SEQ ID NO: sequences 

CC given in the sequence listing. 

XX 

SQ Sequence 2008 BP; 563 A; 463 C; 416 G; 566 T; 0 other; 

Query Match 44.3%; Score 1819.6; DB 21; Length 2008; 

Best Local Similarity 99.7%; Pred. No. 0; 

Matches 1844; Conservative 0; Mismatches 4; Indels 2; Gaps 2; 
Qy 421 AAAAAGTGAAGGTGTAAAAG CAG CACAAGTG CAATAAGAGATATTTCCTCAAATTTGCCT 48 0 

IIIMIIIIMIIIIIIIMIMIIIMMIMIIIIIIIMIIIIIIIIIIIIMIIII 

Db 150 AAAAAGTGAAGGTGTAAAAG CAG CACAAGTGCAATAAGAGATATTTCCTCAAATTTG CCT 2 09 

Qy 481 CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

MM Mill I II I III Ml II II III I III Mill II I II II III II II Mill II III I 

Db 210 CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 269 

Qy 541 CAGTGATAATCCTGAGAGATACAGC^CAAATCTAAGCAATC^TGTGGATGATTTCACC^C 600 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

Db 27 0 CAGTGATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCAC 32 9 

Qy 6 01 TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | 

Db 33 0 TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 38 9 

Qy 661 ACCCAGCAATGGCTCAATGCACAACTATTGCCCAC^ 72 0 

I I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3 90 ACCCAGC^TGGCTCAATGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTT 44 9 

Qy 721 CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 

M Mill Ihlll Ih h II I ||: I Mi: | M MINI 

Db 450 CAAATACA.TTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 509 

Qy 781 AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

MM Mill II Ml II 1 1 Mill III II Mill MM III MM Mill Mill MUM 

Db 510 AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 569 

Qy 841 AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

i ii i : i t in! i ii i ir i ii i ii 1 

Db 570 AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 62 9 

Qy 901 TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

MIIIIIIIIMIIIIIIIIMIIIIMIIMIIMIIIIIIIIIIIIIIIIIIIIIIII 

Db 63 0 TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 68 9 

Qy 961 GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

: UI I II I II U IMM I: III I IMIl ,lh Ii || 



Db 


690 


Qy 


1021 


Db 


750 


Qy 


1081 


Db 


810 


Qy 


1141 


Db 


870 


Qy 


1201 


Db 


930 


Qy 


1261 


Db 


990 


Qy 


1321 


Db 


1050 


Qy 


1381 


Db 


1110 


Qy 


1441 


Db 


1170 


Qy 


1501 


Db 


1230 


Qy 


1561 


Db 


1290 


Qy 


1621 


Db 


1350 


Qy 


1681 


Db 


1410 


Ov 


1741 


Db 


1470 


Qy 


1801 


Db 


1530 



690 GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 74 9 
TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

IIMIIIIIIMIIIIIIIMIMIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIMI 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 8 09 
TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

I ' ; 1 1 : 1 1 II. I M .1 1 1 ! I i : [ f i 1 1 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 869 
TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 1200 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 92 9 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 12 60 
I 'Mill I ,11 ,1 II I 'II I II: I ll'll 'I ,11., I M 
TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 989 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

IIIIIIIIIIIIIMIIIIIIIIMIIIIIIMMIIMIIIIIIIIIIIIIIIIIIMI 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 104 9 
TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 1380 

MIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMMIIIIIIIIIII 

TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 1109 
GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

II II II, II l,,ll ,ll IIMi'll 'I II IN I: II I 'II 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 1169 
GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

IIIIIMIMM MIIIIMIIMIIIIIIIMIIIIIIIIIIMII IIIIIIIMM 

GTTCCCTCTTCATTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAGATGGACAAGAA 122 9 
CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

MIIIIIMIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIMIII 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1289 
CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 1620 

IMIMIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIMIIIMIIIIIIIII 

CATGAATT CATGTATAAACCCCATAGCTCTGTATTTTGTGAG CAAGAAATTTAAAAATTG 134 9 
TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

MMIIIIMMIIIIIIMIIIMIIIIMMIIIIMMIIIIIIIMIIIIIIIIII 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 14 09 
CATGAACGGAA CAAGCAT C CAGTGGAAGAA C CA CGAT CAAAA CAAC CA CAACA CAGA C CG 174 0 

MIIIMIMIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIMIIIIM 

CATGAACGGAACAAGCATCC^GTGGAAGAACCACGATCAAAAC^CCACAACACAGACCG 14 69 
GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

IIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIMMIIIIMIMIIIIII 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1529 
AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

IMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII iM ' , ' 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGAGTCCGGGAATCTCTTCTCTGATC 1589 



Qy 1861 CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAA-GGTAG 1919 

I [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 

Db 1590 CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGGTAG 164 9 

Qy 1920 ACTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACAT 197 9 

IIMIMIIMIIIIMIIIIIIMIIIIIIIMIIIMIIIIIIIIIIIIIIMIIIII 

Db 1650 ACTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACAT 1709 

Qy 1980 ATTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGT 203 9 

MIIMIIIII IIIIIIIIIIMIIIIIIIMIIIM I IIIIMIIIIIIIIIIII 

Db 1710 ATTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGT 1769 

Qy 204 0 TAAATGAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATG 2099 

IIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 177 0 TAAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATG 182 9 

Qy 2100 GCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTA 215 9 

MIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIMIIMI 

Db 183 0 GCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTA 18 8 9 

Qy 2160 GAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGA - TTTTTTGTCCTTCAGCCAAACA 2218 

I I M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 i 1 1 1 M i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 18 90 GAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTTGTCCTTCAGCCAAACA 194 9 

Qy 2219 CAATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTAT 22 68 

1 1 1 1 1 1 1 1 1 i 1 1 1 MIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

Db 1950 CAATATGGGCTCAAGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTAT 1999 

Search completed: December 12, 2003, 11:27:08 
Job time : 960.825 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



December 12, 2003, 09:16:57 ; Search time 14402.7 Seconds 

(without al ignment s ) 
11659.930 Million cell updates/sec 

US-09-931-157-1 
4105 

1 gaattcgcggccgcctcttg taaaagtttacagaaacctt 4105 

I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



2888711 seqs, 20454813386 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5777422 



Database 



GenEmbl : * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 



gb_ba : * 

gb_htg : * 

gb_in : * 

gb_om : * 

gb_ov : * 

gb_pat : * 

gb_ph : * 

gbjpl : * 

gb_pr : * 
gb_ro : * 
gb_sts : * 
gb_sy : * 
gb_un : * 
gb_vi : * . 
em_ba : * 
em_fun: * 
em_hum: * 
em_in : * 
em_mu : * 
em_om : * 
em_or : * 
em_ov : * 
em_pat : * 
em_ph : * 
em__pl : * 
em_ro : * 
em sts:* 



28 


em un : * 


29 


em vi : * 


30 




3 1 


om Irhcr "i nv • * 


32 


Clll llL,y ULllCi . 


33 


pm Vii~cf mile? • * 


34 


dii iiuy pin . 


35 


em htg rod : * 


36 


em_htg_mam : * 


37 


em_htg_vrt : * 


38 


em_sy : * 


39 


em_htgo hum : * 


40 


em_htgo_mus : * 


41 


em_htgo_other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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26 
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27 
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28 


951 


23. 
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9 


S81542 
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29 
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2944 
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AF040634 
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2988 


5 


AF472618 
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31 
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16. 


9 


1032 


9 


S81545 


S81545 endothelin- 


32 
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15. 


5 
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BC044316 
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33 


632.8 


15. 


4 


1650 


5 


XLU06633 


U06633 Xenopus lae 
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15 . 


2 
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1 0 


■rt.IT \J O Z) O J 
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9 
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9 
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T) 1 1 1 A ^ Mnmn oarii on 
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Q 
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-tt.v_ X X U U O J 
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IIUUDj rluilHJ S a.p 1 


39 


r± j j . u 






? 1 S7 
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41 


423 .4 
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.3 
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AF472617 
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42 


411.6 
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. 0 
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5 


AB045356 


AB045356 Oryzias 1 


43 
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4 


AF276427 
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44 
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9. 
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RESULT 1 
AR177879 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 

BASE COUNT 
ORIGIN 



AR177879 



4105 bp 



DNA 



Sequence 1 from patent US 6313276. 
AR177879 

AR177879. 1 GI : 17920234 

Unknown . 

Unknown . 

Unclassified. 

1 (bases 1 to 4105) 

Imura,H'. , Nakao,K. and Nakanishi,S. 

Human endothelin receptor 

Patent: US 6313276-A 1 06-NOV-2001; 

Location/Qualif iers 

1. .4105 

/ organ ism= " unknown" 
1138 a 859 c 845 g 1263 t 



linear PAT 17-DEC-2001 



Query Match 100.03 
Best Local Similarity 100.0' 
Matches 4105; Conservative 



; Score 4105; 
; Pred. No. 0; 
0; Mismatches 



DB 6; Length 4105; 



0; Indels 



0 ; Gaps 



Qy 

Db 



1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 

IMMIIIIIIIIIMIMIIIIIIIIIIIIIIIMIMMIIIMIIIIIIIMMIM 

1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 



60 



60 



Qy 

Db 



61 AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

■ 1 1 1 M I M 1 1 : II II. ! ! 1 1 ■ ■ ■ I ] 

61 AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 120 



Qy 

Db 

Qy 

Db 



121 CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

121 CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 



181 



181 



CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 

I M I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I II I i 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 

CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 



180 



180 



240 



240 



Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 



GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 3 00 

I : MM II ' II II 'Ml M I II Ml II MM II II 

GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 3 00 
CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

IMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIMIMI 

CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 
TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 420 

IMIIIIIIINIIIIIIIIIIillMIMIIIIIIIMIIIMMIIIIIIIIIIIIII 

TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 
AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 480 

I , IMMI M II Ml MM II MM II MM MMIM 

AAAAAGTGAAGGTGTAAAAGCZAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 480 
CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

IIIIIIIIMIMIMIIIIMIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 
CAGTGATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCAC 600 

MIIIIMMIIMIIIIIIIIIMMIMIIIIIIIIIIMIIIIIIIIIIMMIIII 

CAGTGATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCAC 600 
TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 66 0 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIII 

TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

ACCCAGCAATGGCTCAATGC^C^CTATTGCCCACAGCAGACTAAAATTACTTCAGCTTT 72 0 
1 ' : ■ ' ' ^ ' ' , ' ' ' . ' 

ACCCAGCAATGGCTCAATGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTT 72 0 
CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 

M M III M III Ml III Ml M Ml MMMMMI 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 
AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 
AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

II IMMM MM III II IM II M II Ml MMM 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 
TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

IMIIIMIIMII MMIM III INI II MINIMI IIIIMIMIIIIIIII MM 

TAAG CTG CTGG CTGGG CG CTGG CCTTTTGATCA CAATGACTTTGG CGTATTT CTTTG CAA 960 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

1 1 1 1 M 1 1 1 1 I I I I I I I I I I M I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I | I | | | | . 
GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

MMMIMIMM MMMMMIMM I MMMIMMIMM III MMMMMI 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 
TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 
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1261 
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1621 
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Db 
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1 1 II M 1 1 1 1 I M 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 II I M 1 1 1 1 1 ! 1 1 1 II Mill I II 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 
TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 0 0 

IIIMIIIillllMIIIIIIIMIIIIIMIIIIIIIIIMMIMIIIIIIMIIIII 

TGAAG CGATTGG CTT CGTCATGGTACCCTTTGAATATAGGGGTGAACAG CATAAAAC CTG 1200 
TATGCTCAATGCCACAT(^^ATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 

MMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMMIIIIIIMMIIMII 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 
CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIMIIIIIIIMIIIIIIIIIIIIIII 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 
TTGTGAGATGTTGAACAGAAGGAATGGCAG CTTGAGAATTG CC CTCAGTGAACATCTTAA 1 3 80 

IIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIII 

TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 1380 
GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

IIIMMMMIMMMMIMMMMIMIMIIMMMMMMMMMIMM 

G CAG CGTCGAGAAGTGG CAAAAACAGTTTTCTG CTTGGTTGTAATTTTTGCT CTTTG CTG 144 0 
GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 15 0 0 

IMIIIIMIIIIMIIMIMIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIII 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 15 00 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

I I I I I I I I I I I I M I I II I I I I I I I III I I I I I II I I I : I I I I I I I I I lllllll I I i 
CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 162 0 

1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 r 1 1 e i e 1 1 j i r 1 1 1 1 r j i [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 162 0 
TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

IMIIIIIIIIIIMIIIMIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMM 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 168 0 
CATGAACGGAACAAGC^TCCAGTGGAAGAACCACGATCAAAACAACCACAACACAGACCG 174 0 

I I Mill II II II I II II II II MM II II II MM II il Ml II Mil II II MINI I 

C^TGAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACC^C^CACAGACCG 174 0 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

I I II I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I i I I II I I I I I I II I I II I I I I I I I 
GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 
CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 

III II II II II II I II 11 III I II II MM II II MM 1 1 III II II 1 1 II II lllllll 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 
CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 1980 

III II II MM II II I II MM II MM II MM II MM I II II II 1 1 II II Ml I I 
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1921 CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 198 0 
TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 

IIIIIIIIIMIMMMIIIIIMIIIIIIIIIIIIMIIIIIIIIIMIIMIIIIM 

TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 
AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 

IMIMIMIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIII! 

AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 
CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 

IIIIIIIIIIIIIIIIIIIIIIIMMMIIIMIIIIMMIIMMIIMIIIIIMI 

CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 
AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 

MllllillllllMIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 
ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 22 8 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 2280 
TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 

1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 II II I M 1 1 1 II 1 1 1 1 II II 1 1 1 1 1 1 II II 

TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 



2400 



MIIMIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 



INI II II 1 1 III 1 1 II Ml II MM II III II II Mi II II II I MINI II I II I 



TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 252 0 

Ml li Mi: I! 'Ill II.: (I il || || || !|| 

TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 252 0 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2 580 

MIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIMIIIIMIIMIIIIIIIIMI 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 258 0 
TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 2 64 0 

IIMMIMMMMMMMMMMMIMIMIMMMMMIMMM.lMlIM 

TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 
AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 

IMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIMIMIIIMIIIIIIIM 

AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 270 0 
TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2 760 

IIIMIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIM 

TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 
ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 282 0 

IIIIMIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIII 

ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 2 82 0 
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Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 
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Db 


3601 



CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 288 0 

IIIIIMIIIIIIIMIIMIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMI 

CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 28 80 
TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 

II II Mill II II llllllll I llllllll MM I llllllll MM I llllllll MM 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2940 
GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3000 

MMMIMMMMIIMIMMMMIMMMMMMMMMMMMMMIII 

GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3 000 
TCA.GTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 3 060 

IIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMMIIIIIIIIMMIIIII 

TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 3 060 
GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 3120 

1 1 1 1 i 1 1 1 II 1 1 II i I M 1 1 1 II I II 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I M 1 1 1 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 3120 
AAAATCTT.TTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 318 0 

MMMMIMMMIIIIMMMMMIMMIMIIMIIMMIMIIIMIMM 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 318 0 
ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 

Mill MM IMMMI MINIMI MM MM 1 1 III Mill MM MM MINIM 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 
GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 33 00 

1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 II 1 1 II 1 1 1 II 1 1 1 1 1 M 1 1 1 1 II I II 1 1 1 1 1 1 1 II 1 1 II II I 

GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 3300 
GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 

I III Mill MM MM Mill I III MM II III I III MM Mill IMMMI I III 

GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 
TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 3420 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 342 0 
AGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 348 0 

I I : I I ' I I . I ! I I : I I ' < II' I I I , : i I : I I ' I I 

AGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 34 8 0 
TGGTTTGATT^AAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 

I I llllllll I MM I Mill II 1 1 II II III II II II II 1 1 II I II II 1 1 II II IN M 

TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 3540 
CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

I M 1 1 II 1 1 M 1 1 II II I M I II 1 1 II 1 1 II II I II 1 1 1 II 1 1 II I II II I II 1 1 1 II 1 1 

CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 
TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 

1 1 ! 1 1 i 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 



Qy 3661 ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 3720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 3661 ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 372 0 

Qy 3721 CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 378 0 

II I! I! Ml IMMM Ii II II IM M M II M! I 

Db 3721 CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3780 

Qy 3781 CATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 

ii in mill ii ii ii M ii ii in ii 'MMi; 

Db 3781 CATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 

Qy 3841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3 900 

I MM II II IM IM II II M II II: II IMMI Ml 

Db 3 841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3 900 

Qy 3 901 ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3 960 

IIIIIIMIIMIIIIIIIIIIMMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMI 

Db 3 901 ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3960 

Qy 3 961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 

IIMIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIMIMIIIIIIIIIIII 

Db 3 961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTtCAACA 4 02 0 

Qy 4 021 AGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTG 4 08 0 

IIIMIIMIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIMIIMIMIMIIM 

Db 4 021 AGTAAGTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTG 4 08 0 

Qy 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 

II MM Ml I II II MM Mllill 

Db 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 
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AX548830 4105 bp DNA linear PAT 26-NOV-2002 

Sequence 115 from Patent WO02061087. 

AX548830 

AX54883 0.1 GI : 25813724 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Burmer,G.C, Roush,C.L. and Brown , J . P . 

Antigenic peptides, such as for G protein-coupled receptors 
(GPCRs) , antibodies thereto, and systems for identifying such 
antigenic peptides 

Patent: WO 02061087-A 115 08-AUG-2002; 
Lifespan Biosciences, Inc. (US) 

Location/Qualif iers 

1. .4105 

/ organism= " Homo sapiens " 
/ mo l_type= "genomic DNA" 
/db_xref = " taxon : 9 6 06 " 
1138 a 859 c 845 g 1263 t 



ORIGIN 



Query Match 100.0%; Score 4105; DB 6; Length 4105; 

Best .Local Similarity 100.0%; Pred. No. 0; 

Matches 4105; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 60 

MM INN II MINI INI I MINIM INI I MM MINIMI MM MINIM 

GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 6 0 
AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

MIIIIMMIIMIIIIIMIMIIIIIIMIIIIIIIIIIIIIMIIIIIIMMIII 

AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 
CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 18 0 

i I i 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 ! M 1 1 1! ' I ! 1 1 ! 1 1 ! 1 1 II 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 

CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 18 0 
CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

IIIIIIIIMIIMIIIMIIIMIIIIIIMIIIIIIMMIIIIIMMIIIIIIIII 

CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 300 

I I I i I I I I M II I I I I I I I I I I I I II I i I I I I II I I I I I I I M I I I I I I I I II I'M II II 
GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 3 00 

CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 3 60 

I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I II I I I I II I I I I I I I I 
CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 

IIIIIIIIIIIIMIMIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 
AAAAAGTGAAGGTGTAAAAG CAG CACAAGTG CAATAAGAGATATTTC CTCAAATTTGCCT 48 0 

NMMNI INI INN INI MINN NNNIIINNNNI INI INN III 

AAAAAGTGAAGGTGTAAAAG CAG CACAAGTG CAATAAGAGATATTTCCTCAAATTTG C CT 48 0 
CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CAAGATGGAAACC CTTTG CCTCAGGG CATC CTTTTGG CTGG CACTGGTTGGATGTGTAAT' 54 0 
CAGTGATAATCCTGAGAGATACAGCACAAATCTAAG^ 600 

IIIIIMIIIIIIIIIIIIIIMIIIMIIIJIIMIIIMIIMIIMIIIIIIIIIII 

(^GTGATAATCCTGAGAGATAC^GCACAAATCTAAGCAATCATGTGGATGATTTCA.CCAC 600 
TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

MIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIMIII 

TTTTCGTGGC^CAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 
ACCCAGCAATGGCTCAATGC^CAACTATTGCCC^C^ 72 0 

NNINNNN INI INN Illlllll I INI INI NIIIINI MINN I III 

ACCCAGCAATGGCTCAATGCACAACTATTGCCC^ 72 0 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 

MIIMIIIIIIIIMIMIINIIIIIIIIIIIMIIIIIIINIIIIIIMIINMI 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 
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Qy 
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1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 
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Db 


1561 



AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

IIIIMIIIIIIIMIIIIIIIMIIIIIIIIIMIIMIIIMIIIIIIIIIIIIIIII 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 90 0 

I 'II IL M M: 'II II . I! I! 'II II II III I 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

i II 'II II II ii Ml,' II Ml M MM II IN I 

TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 1020 

INI Mill MM MINIM I MM INI I MM MM MM III 1 1 MINIM MM 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 1020 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

II 1 1 II I II 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II I II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

I M I II 'M 'in |l II II! IN II Ml ! !M:II I! 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 1140 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 1200 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

TATG CTCAATG C CACATCAAAATTCATGGAGTTCTAC CAAGATGTAAAGGACTGGTGGCT 12 60 

M II II II' IH II II lh II' II M 1 II 1 II II 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 12 60 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 1320 

1 1 II I II I II 1 1 1 1 1 1 1 1 1 1 II II I II 1 1 1 1 1 1 1 1 1 1 1 1 II II II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 1320 

TTGTGAGATGTTGAACAGAAGGAATGGCAG CTTGAGAATTG CC CT CAGTGAACATCTTAA 1380 

MINMIIMMMMMNNMIIINMIMMMMMMIMIIINNNIN 

TTGTGAGATGTTGAACAGAAGGAATGGCAG CTTGAGAATTG CC CT CAGTGAACAT CTTAA 13 8 0 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 1440 

II II 1 II ll'Ni m| || || .ii || in, m; || ii 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

MM INI INI I Nil INN INI Mill II II III INN INI I II IIMIIMM 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

IMIMIIMIIIIIIMIIIIIIIIMIMMMIIIIIMMIIIIIIMIIIIIIII 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

CATGAATTCATGTATAAAC CCCATAGCTCTGTATTTTGTGAG CAAGAAATTTAAAAATTG 1620 

M Mill M' III Ih II HI i Ml Mi M II Ml III II 

CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAG CAAGAAATTTAAAAATTG 1620 



Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


.1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


wy 


O 1 A 1 


Db 


2341 


Qy 


2401 


Db 


2401 



TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I II I I I I I I 
TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

CATGAACGGAAC^GCATCCAGTGGAAGAACCACGATCAAAACAACCACAACACAGACCG 174 0 

MIMMMMMMMMMIMMMMMIMIMMMMMMMIMMIMM 

CATGAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAAC^CCACAACACAGACCG 174 0 
GAGCAG CCATAAGGACAG CATGAACTGACCAC CCTTAGAAGCACTC CTCGGTACTC CCAT 1800 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 
AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

Mi I Ml 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 MM| Ml 1 1 1 ! 1 1 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 
CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 

IIIIIIIIIIIIIIIMIIIIIIMIIIIIIIMIIIIIIMIIIIIIIIIIMIIIIII 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 
CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 198 0 

IIIIMIIIIIIIIIIIIIIIIIIIMIIMIMIIIIIIIIIIIIIMMMIMIMI 

CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 1980 

TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I 
TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 

AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 

1 1 1 1 1 1 i 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M ! I M 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 

AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 
CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 

IMIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 
AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 

MINIUM INI MINIM MINIM MM IIIIMIIIIIIMII MM MM II 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 2220 
ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 228 0 

MMMIIINNNIIMIINNNNNNNNNININNNINMINNI 

ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 2280 
TAATAGCCTAACATGATTATTTGAACTTATTTACAC^TAGTTTGAAAAAAAAAAGACAAA 234 0 

iiiiiimiiiiiiiiiiiiiiiiiiniiiiiiiiiiiiiiiiinim minim n 

TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 
AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 

II II II II MM MINI II MM II II 1 1 II II II II MINI II MM II II II MM 

AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 0 0 
ACACAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACA 24 60 

NIINMIMNNNINNINNNMININNNNNNNINMIIMM 

ACACAAATTCTAAAGCTAO^CAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACA 24 60 



Qy 



2461 TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 252 0 



Db 


2461 


Qy 
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2521 
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Qy 


2641 


Db 


2641 


Qy. 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


. 2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 252 0 
ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 258 0 

lllllll'IIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 258 0 
TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 

IIIIIIIIIIIMMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 
AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 27 00 

1 1 1 ! I M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 
TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 

MIIIMIIIIIIIIIIMIIIMIIIIIMIIIIIIIIIIMIIIMIIIMIIIIMI 

TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 
ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 282 0 

IIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 

ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 282 0 
CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 2880 

IIMIIIIIIMMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMMIIIIIIIIII 

CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 2880 
TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 

IIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 
GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3 000 

IIIIIIIMMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIilllllllllllll 

GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3 000 
TCAGTATGAACCTAACTCCCCACCCC^C^TCTCCCTCCCACATTGTCACCATTTCAAAG 3060 

IIIIIIIIMIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIII 

TCAGTATGAACCTAACTCCCCACCCCAA(^TCTCCCTCCCACATTGT(^CC^TTTC^AAG 3 060 



IIIIIIIIIMIIIIIIIIIIMIIIIIIIMIIIIIMIIIMMIIIIIIIIIIIM 



AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 3180 

IIIIIIIIIIIIIIIIIIIIIIMIIIIMMIIIIIIIIIIIMIIIIMIIIIIIIM 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 318 0 
ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIII 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 
GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 33 00 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIMIIMIIIII 

GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 33 00 

GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 
1 I I I I I I I I I I I M I I I I I I I M I I I I I I I I II I I I I I I I I I I I I | | | | | | | | | | | | | | | 
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3661 
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3721 


Qy 
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Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 


3961 


wy 




Db 


4021 


Qy 


4081 


Db 


4081 



3301 GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCZAGTCCATTTTAACC 3360 
TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 342 0 

IMIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 342 0 
AGAGG CAG CGTGAAAG CAGATGAG CTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 3480 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIMIIIIIIIIII 

AGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTtTGTTTGGTTGGT 34 80 
TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 

I lllllllll MM I INI Mill MM Mill INI Mill INI II II! lllllllll 

TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 
CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

I Mill MM II III llllllll MM llllll lllllllll II II Mill MM I MM 

CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3 600 
TGT CTGATATTTCTTTCAGACTTCGC CAGACAGATTG CTGATAATAAATTAGGTAAGATA 3660 

IMIIIIIIIIIIIIIMIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIMIIII 

TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 
ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 372 0 

I M II II 1 1 II I II 1 1 1 M II 1 1 1 II 1 1 II 1 1 II 1 1 1 II II II I M I II I II 1 1 M II M 

ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 372 0 
CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 378 0 

M II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II II II II II II II II 1 1 1 II II 1 1 II II 1 1 1 1 1 1 1 1 

CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 378 0 
CAT CATATG CAGTGATATATG CCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 3 84 0 

MUM MM II Mill II MM Mill Mill MM Mill MM I II MM II Mill 

CAT CATATG CAGTGATATATG CCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 
ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCA.TATGAAAAAAATGCATTTTATAA 3900 

M II I II I II M I II II II II II II II I II II I II I M I II II II II 1 1 II I II I II 1 1 1 

ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3900 
ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3 960 

llllll MMMIIIMMMMMMMIMIMMMIMMMMMMIMIMM 

ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3960 
TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 020 

MUM Mill MM llllll III MIMMIMMMMIMM II llllllll I MM 

TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 
AGTAACTTTGTAGAAATGAG CCAGAAGC CAAGG CCCTGAGTTGG CAGTGG CCCATAAGTG 4 080 

IIIIIIIIIIIIIIMIIMIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

AGTAACTTTGTAGAAATGAG CCAGAAG CCAAGG CC CTGAGTTGGCAGTGG CCCATAAGTG 4 080 
TAAAATAAAAGTTTACAGAAACCTT 4105 



RESULT 3 
AX587759 



LOCUS AX587759 4105 bp DNA linear PAT 10-JAN-2003 

DEFINITION Sequence 229 from Patent WO0246467. 
ACCESSION AX587759 

VERSION ■ AX587759.1 GI:28212399 
KEYWORDS 

SOURCE synthetic construct 

ORGANISM synthetic construct 

artificial sequences. 
REFERENCE 1 

AUTHORS Bertucci,F. , Houlgatte , R . , Birnbaum,D. , Nguyen, C. , Viens,P. and 
Fert , V. 

TITLE Gene expression profiling of primary breast carcinomas using arrays 

of candidate genes 
JOURNAL Patent: WO 0246467-A 229 13-JUN-2002; 
Ipsogen (FR) 
FEATURES Location/Qualifiers 
source 1 . . 4105 

/organism^" synthetic construct" 
/ db_xr e f = " t axon : 3 2 6 3 0 " 
/note="primer n 
misc_feature 1. .4105 

/note="endothelin receptor type a (EDNRA) gene." 
BASE COUNT 1138 a 859 C 845 g 1263 t' . 

ORIGIN 

Query Match 100.0%; Score 4105; DB 6; Length 4105; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4105; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 60 

MIIIIMIIMIIIIMIIIIIIIIIIIIIMIIIMMIIIIIIIIIIIIIIIMIII 

Db 1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 60 

Qy 61 AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

IIIMIIIMIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIMIIIMMMIIIIII 

Db 61 AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

Qy 121 CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 180 

IMIIIIIMIMMIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIII 

Db 121 CGCG CG CG CGTACAGTCATCC CG CTGGT CTGACGATTGTGGAGAGG CGGTGGAGAGG CTT 180 

Qy . 181 CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

i r i i J 1 1 1 r 1 1 1 1 f 1 1 1 r r 1 1 1 1 r 1 1 1 f 1 1 1 1 j 1 1 1 r j i r 1 1 1 j i r 1 1 j j i j i r 1 1 1 r i j 

Db 181 CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

Qy ' 241 GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 300 

MIIIIMIMIIMIIMIIMIIIIIIIIIIIIIMMIIIIIIIIIIIIMMIMI 

Db 241 GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 3 00 

Qy 3 01 CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

MMMIMIIIIMMIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIMIMIII 

Db 301 CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

Qy 361 TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 

MIIIMMMIMIIIMIIIIMIIIIIIMMMIMIIIIIIIIIMIIMIIIII 

Db 361 TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 



Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy ■ 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 



AAAAAGTGAAGGTGTAAAAGCAG CACAAGTG CAATAAGAGATATTTCCTCAAATTTG C CT 48 0 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 4 8 0 
CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIMIIIIIIIIIMIII! 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

CAGTGATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCAC 600 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I i I I I I I I I M I I I II I I I I I I I I I 
CAGTGATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCAC 600 

TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

1 1 1 M 1 1 1 1 1 1 1 I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

ACCCAGCAATGGCTCAATGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTT 720 

MIIMIIIIIMMIIIIMIIIIIMIIIIIIIIMIIIIIMIIIIIIIIIIIIIII 

ACCCAGCAATGGCTCAATGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTT 72 0 
CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 
AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

IIIIIIIMMIIIIIIMMIIIIIMIIIIIIIIMIMMIIIMIIMIIIIIIM 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 
AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 
TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

IIIIIIIIIIIIIIMIIIIIMIIMMIIIIIIIIIIIIIIMIII-IIIIIIIIIIII 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 108 0 
TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

MIIIIIIIIIMIMIIIIMIMIIIIIMIIIIIMIIIMIIIIIMIIIMIIM 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 
TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 1200 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 1200 
TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 

MMIMIMIIIIMIIIIMIIIIIIIIIIIIIIIMMIMIIIIIIIIIMIMM 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 
12 61 CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 1320 



Db 


1261 


Qy 


1321 


Db ■ 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 



MMMMMMMIMMMMIMMIIMMMMMMMMMMIMMMIM 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 1320 
TTGTGAGATGTTGAACAGAAGGAATGGCAG CTTGAGAATTG CC CTCAGTGAACATCTTAA 1380 

mmmmmmmimmmmmmimmmmmmmmmmmmmiimm 

TTGTGAGATGTTGAACAGAAGGAATGGCAG CTTGAGAATTGCC CTCAGTGAACATCTTAA 1380 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

MMMMMIMMMMMMMMMMIMMMMMMMMMMIMIMM 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

IIMIIIIIIMIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

INI Mill INI Mil III 1 1 MM MM MM I llll II II MM Mill llllll I! 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 1620 

MIIIIMJIIIIIIIMIIIIIIIMIIIIIIIIIIMIII1IIIIIIIMIMIIIII . 

CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 1620 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 168 0 

MIIIIIIMIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

CATGAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACC^ 174 0 

MMMMMIMMMIMMIIMMMIMMMMMMMMMIIMMIMM 

CATGAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACCAC^ 174 0 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

MIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIMIIIMIMIIIMIIMIMIMI 

GAGCAGCCATAAGGAC^VGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

I II I M II II 1 1 1 II I II 1 1 1 1 II II M I II II I II 1 1 II II II I II II 1 1 II 1 1 M II I 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCC^^ 1920 

IIMIIIIIIIIIIMIIIIIII'MIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMII 

CTTCTTCCTTAATTCACTCCCAC^CCCAAGAAGAAATGCTTTCC^W^ACCGCAAGGTAGA 1920 

CTGGTTTATCCACCC^C^ACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 198 0 

IIIIMIIIIMIIIIIIMIMIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIII 

CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 1980 

TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 204 0 

I I M II I II I II M I II 1 1 II II I II II II II I II II II 1 1 M 1 1 1 II 1 1 II II II II II 

TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 

AAATGAAACC^GAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 

I I I M 1 1 II 1 1 1 M II II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II I II 1 1 1 II 1 1 II 1 1 1 II I II I II I 

AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 

CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 

MlllilllllllllllllllllllllllMIIIIIIIMIIIIIIIIMIIMIIIIII 



Db 



2101 CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 



Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


rvh 

UD 


1 Q O 1 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 



AGATTAACGA(2AAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 2220 

II 1 II II II Ml: M 'II 'II II II ,11 II, !l II 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 
ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 2280 

MIIIIIIIIIMIIIIIIMIMMIMIIIIIIIMIIIIIIIIMIIIIIIIIIMI 

ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 2280 
TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 

II II II MM II II Nil II MINI II II MM MM II II II 1 1 MM II II II II II 

TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 
AATAGTATT^ 2400 
AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 
AC^CAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACA 2460 

II I III ill Mill MM MM MM MM MM III I III I MM III I llllllll II 

ACACAAATTCTAAAGCTACAACAAATACTAC^GGCCCTTAAAGCACAGTCTGATGACACA 24 60 
TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 2 52 0 

M M !M M M! MM M Ml Ml 'M'MM II MMMI M 

TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 252 0 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 258 0 

Ml: I I I .IN I I I I : I I I ' 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 258 0 

TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 

MM II IM IM IM M Ml II II MM IM II MM MM 

TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 
AAAATCAATGTCAAGTAC CAAAATGTTAATGTATGTGT CATTTAACTCTG CCTGAGACTT 2700 

I II II II II IM M'MI II II II MMMM^ II M 

AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 
T CAGTGCACTGTATATAGAAGTCTAAAACACAC CTAAGAGAAAAAGATCGAATTTTTCAG 2760 

I M I II 1 1 1 1 1 1 II M I II I II II II I M I II 1 1 1 1 M II 1 1 1 II 1 1 1 II 1 1 II 1 1 1 1 1 1 

TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 
ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 282 0 

MM I! M II II IM II Ml II Ml IM II M Ml 

ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 2 82 0 
CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 2880 

II Mi ll II II M Ml Mi ll IMMI Ml MMMI M 

CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 288 0 
TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 

M II MM M II IMMi MM IM II Mull MUM. MM 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 
GAAAAATAATTACCGACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3 000 

IIMMIIIIMM IIIMIIMIMMMMMMIMIIIMM MMIIIMIMM 

GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3 000 



Qy 


3001 


TCAGTATGAACCTAACTCCCCACCC 

IIIIIMIIMIIMMMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIMI 

TC7VGTATGAACCTAACTCCCCACCCC^CATCTCCCTCCCACATTGTC^CCATTTCAAAG 


3060 


Db 


3001 


3060 


Qy 


3061 


GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 

MM Mill II III III III II MM Mill III Mill II II MM Mill III II III 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 


3120 


Db 


3061 


3120 


Qy 


3121 


AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 

II M 1 1 M 1 M 1 II 1 1 II 1 1 1 1 II 1 II II 1 II II II II II II 1 1 II 1 1 1 1 1 1 II 1 II 1 II 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 


3180 


Db 


3121 


3180 


Qy 


3181 


ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 

II 1 II II II II 1 1 1 1 1 1 1 II II II II II 1 1 1 II II II 1 1 II II 1 1 1 1 II 1 II 1 1 1 1 1 II 1 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 


3240 


Db 


♦3181 


3240 


Qy 


3241 


GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 

! : i 1 ! ; ! ! ! I : 1 i i 1 1 ■ - : ; 1 I 

GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 


3300 


Db 


3241 


3300 


Qy 


3301 


GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 

M M M M II 1 1 II 1 M 1 1 II 1 II 1 1 II II 1 II II II 1 II II 1 1 M 1 M 1 II II II II 1 1 

GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 


3360 


Db 


3301 


3360 


Qy 


3361 


TGTAGCAACCTTGTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | 1 1 M | | | | | | | 

TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 


3420 


Db 


3361 


3420 


Qy 


3421 


AGAGG CAG CGTGAAAGCAGATGAGCTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGT 

MIIIIIMIIMIIIIMIMMIMIIIIIIMIIIIIIIIIIIIIIIMIIIIIMI 

AGAGG CAG CGTGAAAG CAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGT 


3480 


Db 


3421 


3480 


Qy 


3481 


TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 

M 1 II II 1 1 1 1 1 1 1 1 M 1 1 II 1 II 1 M 1 1 II 1 II M II 1 II 1 1 1 M 1 1 M 1 1 II 1 1 II II 

TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 


3540 


Db 


3481 


3540 


Qy 


3541 


CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 

MIMIIMIIIIIIIIIIIMIIIIIIIIIIIIIMIIMIIIIIIIIMIIIIIIMI 

CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 


3600 


Db 


3541 


3600 


Qy 


3601 


TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 

MMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIMIIIIIIIII 

TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 


3660 


Db 


3601 


3660 


Qy 


3661 


ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 

MINIMI III II II III II II II Ml III MINI Ml Mill II MM II MM III 

ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 


3720 


Db 


3661 


3720 


Qy 


3721 


CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 

IMIMIIIMMIIIMIIIMMMIIIMIIIIIMIIIIIIIIIIIIIMIIIIII 

CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 


3780 


Db 


3721 


3780 


Qy 


3781 


CATCATATG CAGTGATATATG CCTATAATATAAGCCATAGGTT CACAC CATTTTGTTTAG 

IIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIMIIIIIIII 

CATCATATG CAGTGATATATG CCTATAATATAAG C CATAGGTTCACACCATTTTGTTTAG 


3840 


Db 


3781 


3840 



Qy 3841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3 900 

IIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

Db 3841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3900 

Qy 3 901 ATTCAGAAAGT CATAGATTTCTGAAGGCGTCAACGTG CATTTTATTTATGGACTGGTAAG 3960 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIMIIIIIIIIIIMIIIIII 

Db 3 901 ATTCAGAAAGT CATAGATTTCTGAAGGCGTCAACGTG CATTTTATTTATGGACTGGTAAG 3960 

Qy 3961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 020 

IIIIIIMIIMMIIIIIIMMIIIIMIIIIIIIIIIIMIMIIIIIIIIIIIMI 

Db 3 961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 

Qy 4 021 AGTAACTTTGTAGAAATGAG CCAGAAGCCAAGG CCCTGAGTTGG CAGTGG CCCATAAGTG 4 080 

I II II II MM II I Mill II I II II II II II II II II I II I Ml I Mill II MM III 

Db 4 021 AGTAACTTTGTAGAAATGAG CCAGAAGC CAAGG CC CTGAGTTGG CAGTGG C CCATAAGTG 4 080 

Qy 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 

i: ii: ii :n i 

Db 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 
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E07649 4105 bp RNA linear PAT 29-SEP-1997 

cDNA encoding endothelin receptor, ETA-receptor . 

E07649 

E07649. 1 GI :2175784 
JP 1994157595-A/l. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 4105) 

Imura,H., Nakao,I. and Nakanishi , S . 

HUMAN ENDOTHELIN RECEPTOR 

Patent: JP 1994157595-A 1 03-JUN-1994; 

SHIONOGI 6c CO LTD 

OS Homo sapiens (human) 

PN JP 1994157595-A/l 

PD 03-JUN-1994 

PF 12-JUL-1991 JP 1991172828 

PI IMURA HIROO, NAKAO ICHIKAZU, NAKANISHI SHIGETADA PC 
C07K13/00, C12N5/l0,C12N15/l2,C12P2l/02, (C12N5/10, C12R1 : 91) , PC 
(C12P21/02, 



PC 


C12R1: 91) ; 




CC 


strandedness 


: Double; 


CC 


topology: Linear; 


CC 


hypothetical 


:" NO; 


CC 


anti-sense : 


No; 


FH 


Key 


Location/Qualif iers 


FH 






FT 


source 


1. .4105 


FT 




/ organism= 1 Homo sapiens 


FT 




/tissue_type= 'placenta 1 


FT 




/clone=*phETIR' 


FT 


5 »UTR 


1. .484 


FT 


CDS 


485. .1768 



I 



FT /product =' endothel in receptor , ETA-receptor * FT 

sig_peptide 485. .544 

FT /product= ' signal peptide of endothelin FT 

receptor, 
FT ETA-receptor' 
FT mat_peptide 545. .1765 

FT /product =' endothelin receptor , ETA-receptor ' FT 

3'UTR 1769. .4105 

FT polyA_signal 4084. .4089. 
FEATURES Location/Qualifiers 
source 1. .4105 

/organism="Homo sapiens" 
/mol_type= "genomic RNA" 
/db_xref-"taxon: 9606" 
BASE COUNT 1138 a 859 c 845 g 1263 t 

ORIGIN 

Query Match . 100.0%; Score 4105; DB 6; Length 4105; 

Best Local Similarity 100.0%;. Pred. No. 0; 

Matches 4105; Conservative " 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 60 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMMMIIIMIIIII 

Db 1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 60 

Qy 61 AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

MINI I II Ml I II 1 1 II II II I II II I MM II I II II II I II Ml 1 1 II III 1 1 II I. 

Db 61 AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

Qy 121 CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 180 

M I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I II M I I I II I I I I I I I I 
Db 121 CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 180 

Qy 181 CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

I I I I I I I I I I M I I I I I I II I II I I I I II I I I I II I II M II II I I I M I I II I I I I I I I 
Db 181 CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

Qy 241 GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 300 

I I I I I I I II I I I I I I I M I I I I II I I I I I I I I II II II I I I I II M I M I I I I I I I I M I 

Db 241 GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 300 

Qy 301 CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

I I M 1 1 1 II II I II 1 1 II 1 1 II 1 1 1 1 1 1 M 1 1 MM II 1 1 1 1 1 1 II 1 1 1 1 II I II 1 1 1 II 

Db 301 CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

Qy 3 61 TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 

1 1 1 1 1 1 i I i 1 1 1 1 1 1 1 1 1 1 1 E 1 1 E t j [ I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 i 1 1 f 1 1 E 

Db 3 61 TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 420 

Qy 421 AAAAAGTGAAGGTGTAAAAG CAG CACAAGTG CAATAAGAGATATTTC CTCAAATTTG C CT 48 0 

I I M I I I II II I I I I I I I II I I I I I I I I I I M II II I I II I I II II I I I M I M I I I II I 

Db 421 AAAAAGTGAAGGTGTAAAAGC^GCACAAGTGCAATAAGAGATATTTCCTaU^TTTGCCT 48 0 

Qy 4 81 CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

I I M M 1 1 1 1 M 1 1 1 1 1 1 II I M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I M I II M 1 1 M I II 

Db 481 CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 



Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 



CAGTGATAAT C CTGAGAGATACAG CACAAATCTAAG CAATCATGTGGATGATTTCACCAC 600 

INI INI I INI INI INI 1 1 Ml IN I II II II III Mil MM I INI II II INI 

CAGTGATAATCCTGAGAGATACAG CACAAATCTAAG CAATCATGTGGATGATTTCACCAC 600 
TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

IIIIMIIMIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIMIIIIMIIIIIIIIII 

TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 
ACCCAGCAATGGCTCAATGCACAACTATTGCCCA 720 

II II Ml II MINI II MUM III II II II I INI I! II II Mill II MM II II 1 1 

AC CCAG CAATGG CTCAATG CACAA CTATTGC C CACAGCAGA CTAAAATTACTT CAG CTTT 72 0 
CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 

M III II II III II I! 1 II! II ,||' MIMI II , I 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 
AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

1 1 II M 1 1 M 1 1 1 1 1 II I II II II I II 1 1 1 II I II M M II II II J 1 1 II II 1 1 1 1 II 1 1 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

M M I M II II II II I I I II II M II II I II I I I II I I I I II I II I II II I II I I II M I 
AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

II 1 1 1 M II II II II II 1 1 II 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 II II 1 1 1 II II II II II II I 

TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 
GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 1020 

MMMMMMIMIMMIMIMMMMIMMMMMIMMMMMIIMM 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 1020 
TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 108 0 

1 1 1 1 1 M II I II I II II 1 1 1 II II II I M M 1 1 1 II I M I II II M II II II II II I II I 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 
TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 
TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

M I M II II II 1 1 1 1 1 II II I II 1 1 II I II II I II II 1 1 II M I II II I II M I II II II 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 
TATGCTCAATGCCAC^TCAAAATTC^TGGAGTTCTACC^GATGTAAAGGACTGGTGGCT 1260 

MIMMMMMMMMIMMIMMMMMMMMMMMMMMMIMM 

TATGCTCAATGCC^CATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 12 60 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 1320 
M I M II II I II II II I II I || I I II II II I II II II II I I II II I I II II I II I I I I II 
CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 1380 

I M MM MM M Mill II MM II II MIMM II II II I MM I II 

TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 1380 
1381 GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 
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II IM IM Ml II M III Mi III Ml MIM M M 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 
GTTC C CTCTTCACTTAAGC CGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

IMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

CATGAATTCATGTATAAACC CCATAGCT CTGTATTTTGTGAG CAAGAAATTTAAAAATTG 162 0 

I M III INI II II 1 1 III MM 1 1 1 1! Mill MM I III! I II I MM! Mill MM 

CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 162 0 
TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 168 0 

IIMIIIIIMIIIIIIIIIIIIIIIIIMMMIIIIIIIIIIIIIIMMIIIIIIII 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 168 0 
CATGAACGGAAC^GC^TCCAGTGGAAGAACCACGATCAAAACAACC^CAACA.CAGACCG 174 0 

I II M' MM M II II Ml II I! MM Ml Ml 

CATGAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACCACAACACAGACCG 174 0 
GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

IIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIMMIIIIIIIIIMIIIMIIIIMI 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 18 00 
AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

IMIIIMIIIIIMMIIIIIMMIMIIIIIIIIIIIIIIIIMIMIIIIIIIIII 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 
CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 

IM M Ml Ml Ml II Ml IM 1 M Ml Ml! M I 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 
CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 1980 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 1980 
TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 

IMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIII 

TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 
AAATGAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATGG 2100 

IMIIIMIMIIIIIIIIIIIIIIIIIIIIIIIMIIIMIMIIIIIIMIIMIIII 

AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 
CTAGCTTTTATGGCAGTTGTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 

I MINIM Mill II IIIMIIIIIIi, MINI II MINIM !||||||| ||| 

CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 
AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 222 0 

IIIIIIIMIIIIIMIJMIIIIIIIIIIIIIIIIMIIIIIIIIMIIIMIIIIIII 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 2220 
ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 22 8 0 

IIIIIIIIIIIIIIIIIIIMIMIIMIIIIIIIIIIIMIIIIIIIMIMMIIIII 
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2221 ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 22 8 0 
TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 

Ml II IM III II II I! II IIIMI II II MM 

TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 
AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 

MM IM II i IM II ' IM IM II III MM Ml IM M 

AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 
ACACAAATTCTAAAGCTACAACAAATACTACAGG 24 60 

M IM IM II M' IM IM IM MMMM MMM , II 

ACACAAATTCTAAAG CTACAACAAATACTACAGGC CCTTAAAG CACAGT CTGATGACACA 24 60 
TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 252 0 

M IM III II IMMMM MM IM.ill M,, M Ml , II 

TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 252 0 
ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2580 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2 58 0 
TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 264 0 

II MM MM MM II M MIM II MM MM II: IM Ml 

TTCAAtCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 2 64 0 
AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 

M IMI MM Ml, II IM MM IMMM MM M MM II 

AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 
TCAGTG CACTGTATATAGAAGT CTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 

IIIIIIIIIIIIIIIIIIMIMMIIIIIMIIIIIIIIIMMIIIIIMIMMIII 

TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAAf TTTTCAG 2760 
ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 2 82 0 

IMIIIIIIMIIIIIIMIIMIMIIIIIIIIIMIIIIIIIIIIIIMMIIIIIII 

ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 282 0 
CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 2880 

IIMIMIIMIIIIIIIIIIIIIMIIIIIIMIIIIIMIIIIMIIIIIMIIIIII 

CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 288 0 
TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 294 0 

IMIIIIIIIIIMIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIMIIIII 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 2 94 0 
GAAAAATAATTACCCAC^ATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3 000 

M:, II MIM II Mi II MM II Ml MM II MM Ml 

GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3 000 
TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 3 060 

MIIIIIMIIMIIIIIMIIIIIMIIMIIIMMIIIIIIIIMIIIIIIIIIIII 

TCAGTATGAACCTAACTCCCCACCCCAACAT^ 3 060 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 3120 

I Mil! MM llllllill llllllill llllllill MM I IMI Mill 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 312 0 
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AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 3180 

M MM III III II IM M IMMI II MM IMMIMII III 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 318 0 
ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 

M M M MM . MINI II II MM Illllllll Mil 1 1 II II II II II 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 

GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 3300 

M M I I I I II I I I I I I I II I I || M I I I I I M II M II I I II I I I I I I I I M II II II I I 
GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 33 00 

GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 33 60 

m 1 1 , i i . ; , 1 1 , 1 1 . ii ii m 

GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 
TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 342 0 

M I M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 M I II 1 1 1 1 1 1 1 M II 

TGTAGC^ACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 342 0 
AGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 3480 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II II II 1 1 1 1 II M 1 1 1 M 1 1 M 1 1 M I M I II 1 1 II I M 1 1 1 

AGAGG CAGCGTGAAAGCAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGT 34 8 0 

TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 3 54 0 

I I M II I I I I I II I I M I I I I I I M I I II I I I I I I I I I II II II II I I II I I I I || M I I 
TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 3 5 4 0 

CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3 600 

I I I I M II I II II I I I I II M II I I I I I I I I M I I I II II II II I I II I I I I I II I I M I 
CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 

1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 M II M 1 1 1 M 1 1 II II M 1 1 M II I M 1 1 1 II 1 1 

TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 

ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 372 0 

I I I I M M I II I I II II I I M I I I II I I I II I I I I I I II I I I I I II I I II I I I I I I I I I I 
ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 372 0 

CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 378 0 

Ml MM I Illlllll Illlllll MM MM I MM MM I III MM IIIIMMIII 

CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 378 0 
CATCATATG CAGTGATATATGCCTATAATATAAG C CATAGGTTCACACCATTTTGTTTAG 384 0 

M 1 1 1 M M II I II 1 1 1 II M I M I II 1 1 1 II II 1 1 II Ml II 1 1 1 1 II II I II 1 1 1 II I 

CATCATATG CAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 
ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3 900 

I I M I II M 1 1 M 1 1 II II 1 1 1 1 II 1 1 M I II II I II I II II II 1 1 II II II II 1 1 M II 

ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3900 
ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3 960 

Ml Ml M 1 1 II II 1 1 II 1 1 II II 1 1 1 1 II I IM III III, IM II U . 

ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3960 



3961 TAACTGTGGTTTACTAGC^GGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 

II Mill III 1 1 Mil I IIIIIMIIIIII Mill I INI Mill Mill MM MINI 

3961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 02 0 

4 021 AGTAACTTTGTAGAAATGAGC CAGAAGCCAAGG C CCTGAGTTGG CAGTGG CCCATAAGTG 4 08 0 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMI 

4 021 AGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTG 4 080 
4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 

IIMIMIMIIMIIIIIIIIIII 

4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 



RESULT 5 
HSET1R 

LOCUS HSET1R 4105 bp mRNA linear PRI 23 -MAR- 19 93 

DEFINITION H. sapiens mRNA for endothel in-1 receptor. 
ACCESSION X61950 
VERSION X61950.1 GI:288312 

KEYWORDS endothel in-1 receptor. 
SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 4105) 

AUTHORS Hosoda,K., Nakao,K., Hiroshi-Arai , Suga,S., Ogawa,Y., Mukoyama,M. , 

Shirakami,G. , Saito,Y. , Nakanishi,S. and Imura,H. 
TITLE Cloning and expression of human endothelin-1 receptor cDNA 

JOURNAL FEBS Lett. 287 (1-2), 23-26 (1991) 
MEDLINE 91348221 ■ 
PUBMED 1652463 
FEATURES Location/Qualifiers 
source 1. .4105 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
CDS 485. .1768 

/codon_start=l 

/product =" endothel in-1 receptor" 
/protein_id="CAA43 953 .1" 
/db_xref ="GI :288313" 
/db_xref="SWISS-PROT: P25101" 

/ 1 rans 1 a t i on = " METLCLRAS FWLALVGCVI SDN PER YSTNLSNHVDDFTTFRGTE 
LSFLVTTHQPTNLVLPSNGSMHNYCPQQTKITSAFKYINTVISCTIFIVGMVGNATLL 
RI I YQNKCMRNGPNALI ASLALGDLI YWI DLP I NVFKLLAGRWPFDHNDFGVFLCKL 
FPFLQKSSVGITVLNLCALSVDRYRAVASWSRVQGIGI PLVTAIEIVSIWILSFILAI 
PEAIGFVMVPFEYRGEQHKTCMLNATSKFMEFYQDVKDWWLFGFYFCMPLVCTAIFYT 
LMTCEMLNRRNG S LR I ALS EHLKQRRE VAKTVF CL W I FALCWF PLHLSRI LKKTVYN 
EMDKNRCELLSFLLLMDYIGINLATMNSCINPIALYFVSKKFKNCFQSCLCCCCYQSK 
SLMTS VPMNGTS I QWKNHDQNNHNTDRSSHKDSMN " 

BASE COUNT 1138 a 859 c 845 g 1263 t 

ORIGIN 
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Query Match 10 0.0%; Score 4105; DB 9; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 4105; Conservative 0; Mismatches 0; 



Length 4105; 

Indels 0; Gaps 0; 
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GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 6 0 

MIIMIIIIIIIIIIIIMIIMIIMIIIIIIIIIIIIIMIIIIMIIIIIMIMI 

GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 6 0 
- AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 

III MINIMI II I Mill INI MM MM II II MM! I III llllllll INI III 

AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 12 0 
CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 18 0 

MIMMMMMIMMMMMMMMMMMMIMMIMMMMMMIMI 

CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 180 
CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 

I = : . I - i I i I 

CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 24 0 
G CAGTG CC CAGGAAGTTTTCTGAAG CCGGGGAAG CTGTG CAG CCGAAG C CGCCGCCG CG C 300 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 II hi II I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 

GCAGTGCC CAGGAAGTTTTCTGAAG CCGGGGAAG CTGTG CAG CCGAAG C CGCCGCCG CGC 300 

CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I 
CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 

IIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIll 

TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 
AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 4 8 0 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 f I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 48 0 
CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

II II : ' 1 1 I - : 1 1 ■ 1 1 1 1 1 1 ■ ' 1 1 II : 1 1 I [ ' i f : j I 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 
CAGTGATAATCCTGAGAGATACAGCACAAATCTAAGC^ATCATGTGGATGATTTCACCAC 600 

II I II II 1 1 II I M 1 1 1 II I II II 1 1 II II II I II 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 II I II 1 1 

CAGTGATAATCCTGAGAGATACAG CACAAAT CTAAG CAAT CATGTGGATGATTTCAC CAC 600 
TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

M 'M II II MM! 1 I! II III I 'III II, III M M! 

TTTTCGTGGC^CAGAGCTCAGCTTCCTGGTTACCACTa^TCAACCCACTAATTTGGTCCT 660 
ACCCAGCAATGGCTCAATGCACAACTATTGCCCACAGC^ 72 0 

MIIIIIIIMIIIIMIIIIIIIIIIMIIIMIMIIIIIIIIIIMLIMMIIIII 

ACCCAGCAATGGCTCAATGCACAACTATTGCCCACAGC^ 720 
CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMMIIIIIMIIIIIIIII 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 
AACTCTGCTCAGGATCATTTACCAGAACTU^TGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

MMIIIIIIIIIMMIIIIIIIIIIIMIIIIMIIIIIIIIIMIIIIIIIIIIIII 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 
841 AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 90 0 



Db 841 AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

Qy ■ 901 TAAG CTG CTGG CTGGG CG CTGGCCTTTTGATCACAATGACTTTGG CGTATTTCTTTG CAA 960 

MIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMMIIIIMIIIII 

Db 901 TAAG CTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGG CGTATTTCTTTG CAA 960 

Qy 961 GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

IIIIIIMIIIIIMIIIIIIIIIMMIIIIIIMIIIIIMIIIIIMIIIIMIIII 

Db 961 GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

Qy 1021 TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

IIIMIIIIIMIIIIMIIIIIIIIIIIIIMIIMIIIIIIMIIIIIIIIIIIIIII 

Db 1021 TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 108 0 

Qy 1081 TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

IMIMIIIIIIIIIIIIMIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIMIIIIM 

Db 1081 TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

Qy 1141 TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

IIIIIIIMIIIIIIIIIIIMIMIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIII 

Db 1141 TGAAG CGATTGG CTT CGTCATGGTAC C CTTTGAATATAGGGGTGAACAG CATAAAACCTG 12 00 

Qy 12 01 TATGCTCAATGCC^C^TC^W^TTCATGGAGTTCTACC^GATGTAAAGGACTGGTGGCT 1260 

I M M I I I I M I I M I I I I I I I I I II II I I I I I I I I I I I I I I I II II I I I II I I I I I I I I 
Db 12 01 TATGCTCAATGCC^CATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 12 60 

Qy 1261 CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

• IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMI 

Db 1261 CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

Qy 1321 TTGTGAGATGTTGAACAGAAGGAATGG CAG CTTGAGAATTGC CCTCAGTGAACATCTTAA 138 0 

MlirillllMIIIIIIIIMMIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIMI 

Db 1321 TTGTGAGATGTTGAACAGAAGGAATGG CAG CTTGAGAATTG CCCTCAGTGAACATCTTAA 1380 

Qy 1381 GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

IMMIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIillMIIII 

Db 1381 GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

Qy 1441 GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 150 0 

MM MUM II II II I II III MM II III II II I II II Mill lllllll III INN 

Db 1441 GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

Qy 15 01 CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

I M M I II II I I I II I I I II II I I I II I I I II I II I II I I I I I I I I II II II I I I I I I II 
Db 1501 CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

• Qy 1561 CATGAATTC^TGTATAAACCC(^TAGCTCTGTATTTTGTGAG(^GAAATTTAAAAATTG 162 0 

MMIIIIIIIIMIMIIIIIIIMIIIIIMIIIIIIIIIIIIMI I II I III I MM 

Db 1561 CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 162 0 

Qy 1621 TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 168 0 

M 1 1 1 1 II 1 1 II 1 1 1 M I II II M II I II I II 1 1 1 II 1 1 1 1 1 1 1 1 1 II II M II II II II 

Db 1621 TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

Qy 1681 CATGAACGGAACAAGCATCCAGTGGAAGAACCACGA 174 0 

IMMMMMMMMMirMMMMMMMMMMMMIMMIMMIMM 



Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Ov 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 



1681 CATGAACGGAACAAGCATCCAGTGG 174 0 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

IM IM II IM II Ml IMMM! II Ml IM II II II 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 
AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

MMMIMMMMMMMMIMMMMMMMIiMMMIIMMMMMM 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 
CTTCTTCCTTAATTCACTCCCZAC^CCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAG^ 192 0 

I M Ml Ml II Ml I! II II II II , IM IMMI 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 
CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 1980 

M II , II II II Ml II MMmM Ml Ml Ml Ml li 

CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 1980 
TTCTG CGTGTTGTATTCAG CACTAAAAAATGGTGGGAG CTGGGGGAGAATGAAGACTGTT 2 04 0 

I II MINIMI II II MINIMI II MM Mill II 11 MM Mill MM II II III 

TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 
AAATGAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAGCTTT CAAGTACATGG 2100 

MMMMMMMIMMMMMMMMMMMMMMIIMMMMMMIM 

AAATGAAAC CAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATGG 2100 
CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 

M Ml Ml M M II Ml M MM IMMM Ml II M 

CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 
AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 2220 

MMMM II MM II, M IMMM II Ml III IMMM M 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 2220 
ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 22 80 

MM INN INI I III Mill MM INI || MM 1 1 Ml 1 1| MM Ml | Ml | MM 

ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 228 0 
TAATAGCCTAACATGATTATTTGAACTTATTTAC^CATAGTTTGAAAAAAAAAAGACAAA 234 0 

I M II I i I II II II 1 1 II II II II I II 1 1 ll M I II 1 1 1 1 1 1 II I II II 1 1 II 1 1 1 1 1 II 

TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 
AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 

M M I M I M 1 1 II 1 1 II II I II II II 1 1 1 II II II II II 1 1 II II I II 1 1 II II I II II 

AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 
ACA.CAAATTCTAAAGCTACAACAAATACTACA^ 2460 

Mill MMMM II III IIIIIIMIIIIIIIIIIIIIIIIMMIMIII Mill III 

AC^CAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACA^ 24 60 
TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 2520 

M I II 1 1 1 1 M 1 1 II 1 1 1 1 II M I M II 1 1 1 II I M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 

TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 2520 
ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 258 0 

M Ml MM II, Ml M MM Ml IMMI IM II Ml l M 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2580 



Qy 


2581 


TTCAATC^GATAGTTCTTTTTCAC^GTTC^ATACTGTTTTTCATGTAAATTTTGTATGA 


2640 






1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 l" 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 M 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2581 


TTCAATCAGATAGTTCTTTTTCACAAGTTCIAATACTGTTTTTrATnT A A A TTTTflTATG A 

-*- *x **■ * a j- x. x x x x x \^i\^iAfi\J X X \— ■ rxxx X X v_J X X X X X V rfi X \J X xxxxxx X X X X W X xx X VJaI 


2640 


Qy 


2641 


AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 


2700 






1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II i 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 
1 M 1 1 II 1 1 1 II 1 II 1 1 1 II 1 1 1 II 1 II I I 1 I I I I M I I I ! M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2641 


AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCGTGAGACTT 


2700 


Qy 


2701 


TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 


2760 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 
1 1 1 1 1 M 1 1 M 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 II 1 1 1 1 1 I 1 1 1 1 II 1 1 1 1 




Db 


2701 


TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 


2760 


Qy 


2761 


ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 


2820 






1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2761 


ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 

* x v_^x a * * ' i ^ ^ x x f£ x x x \«J X xx XXX VJ X xxxx. X xxV_J X VJitx^Ji X xx X xx X xx A \J X xx X xx X xx Vw-xx X xx X 


2820 


Qy 


2821 


CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 


2880 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 II 1 II 1 1 1 1 1 II II 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 




Db 


2821 


CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 


2880 


Qy 


2881 


TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 


2940 






1 1 1 1 1 ! 1 ! 1 1 1 1 1 1 1 1 1 III 1 M Ml 1 1 1 1 1 1 1 II 1 1 1 1 1 1 ! 1 ! 1 1 1 1 1 II M 1 1 1 II M 




Db 


2881 


TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 


2940 


Qy 


2941 


GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 


3000 






1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2941 


GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 


3000 


Qy 


3001 


TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 
1 1 1 1 1 1 1 1 II 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


3060 


Db 


'3001 


1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 


3060 


Qy 


3061 


GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 


3120 






I 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 II 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


3061 


GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 


3120 


Qy 


3121 


AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 


3180 






1 1 1 II 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


3121 


AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAG C CC 


3180 


Qy 


3181 


ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 


3240 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


3181 


ATTTTT CTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATG CATGTGTGTGAT 


3240 


Qy 


3241 


GGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATA 


3300 






1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 II 1 M 1 1 I 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 I I I I 1 I I I I I I I I I 1 1 1 1 1 1 1 1 




Db 


3241 


GGTATGTATGGATTTAATCTAATGTAATAATTGTGCCC CG CAGTTGTG CCAAAGTG CATA 


3300 


Qy 


3301 


GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 


3360 






II MM Mil INI II II MM III MM INI II II MM I III Mil Mil MM II 




Db 


3301 


GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 


3360 


Qy 


3361 


TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 


3420 






1 Ml MM II III IMM Ml Ml II II II MM II II 1 




Db 


3361 


TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 


3420 



Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3 601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 


3961 


Ov 




Db 


4021 


Qy 


4081 


Db 


4081 



AGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 34 8 0 

II MMMM II MM II II II Ml II Ml I II' Mill 

AGAGG CAGCGTGAAAGCAGATGAG CTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 34 80 
TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 

MINIMI I IIIHIIIIIIMIIIIIIMIMIIIIIIIIIIIIIIIIIIIII Mill 

TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 
CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

MM III II MM MINIM I MM MM MINIMI MM INI I II II MINIM 

CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 
TGTCTGATATTTCTTTCAGACTT CG CCAGACAGATTG CTGATAATAAATTAGGTAAGATA 3660 

IIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIII 

TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 
ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 3720 

IIIIMMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIMIIIIIIMMIM 

ATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTG 3720 
CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 378 0 

Mi: II II Ml II II II III II IM III III lh II 

CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 378 0 
CATCATATG CAGTGATATATG C CTATAATATAAG CCATAGGTTCACAC CATTTTGTTTAG 384 0 

II II IMM MM II M MMMI II II II II M 

CATCATATG CAGTGATATATG CCTATAATATAAG C CATAGGTTCACAC CATTTTGTTTAG 384 0 
ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3900 

M IMM! MM III II, II III IMMM III MM II II 

ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3 900 
ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3 960 

IIIIIMIIIIIIIIIIIIIIIMIIMMIIMMIIIIIIIMMIIIIIIIIIIIII 

ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3 960 
TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 020 

MIIMIIMIIIIIIIIMIIilllllllllllllllMIIMIIIIIIIIIMIIIII 

TAACTGTGGTTTACTAGC^GGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 020 
AGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTG 4 080 

MIMIIMIIIMIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIMMIM 

AGTAACTTTGTAGAAATGAG CCAGAAG CCAAGG CCCTGAGTTGG CAGTGG CCCATAAGTG 4 080 
TAAAATAAAAGTTTACAGAAACCTT 4105 

IIMNNNNNNNNINN 

TAAAATAAAAGTTTACAGAAACCTT 4105 



RESULT 6 
S57498 

LOCUS S57498 4105 bp mRNA linear PRI 29-JAN-2002 

DEFINITION endothelin ET-A receptor [human, mRNA, 4105 nt] . 

ACCESSION S57498 

VERSION S57498.1 GI: 298319 

KEYWORDS 

SOURCE Homo sapiens (human) 



ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REMARK 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 410.5) 

Arai,H., Nakao,K., Hosoda,K., Ogawa,Y., Nakagawa,0., Komatsu,Y.. and 
Imura,H. 

Molecular cloning of human endothelin receptors and their 

expression in vascular endothelial cells and smooth muscle cells 

Jpn. Circ. J. 56 Suppl 5, 1303-1307 (1992) 

93180293 

1291713 

GenBank staff at the National Library of Medicine created this 
entry [NCBI gibbsq 128422] from the original journal article. 
This sequence comes from Fig. 5. 

Location/Qualifiers 

1. .4105 

/organism="Homo sapiens" 
/ mol_type= " mRNA" 
/db_xref="taxon: 9606" 
485. .1768 

/note="This sequence comes from Fig. 5; conceptual 
translation presented here differs from translation in 
publication; ET-AR" 
/ codon_start=l 

/product^" endothelin ET-A receptor" 
/protein_id= H AAB25530 . 2 " 
/db_xref ="GI : 18390352" 

/ 1 rans 1 a t i on= " METLCLRASFWLALVGCVI SDNPERYSTNLSNH VDDFTTFRGTE 
LSFLVTTHQPTNLVLPSNGSMHNYCPQQTKITSAFKYINTVISCTIFIVGMVGNATLL 
RI I YQNKCMRNGPNAL I AS LALGDL I YWI DLP I NVFKLLAGRWPFDHNDFGVFLCKL 
F P FLQKS SVG I TVLNLCALS VDR YRAVAS WSRVQG I G I PLVTA IEIVSIWILSFI LA I 
PEAIGFVMVPFEYRGEQHKTCMLNATSKFMEFYQDVKDWWLFGFYFCMPLVCTAIFYT 
LMTCEMLNRXNGSLRIALSEHLKQRREVAKTVFCLWI FALCWF PLHLSR I LKKTVYN 
EMDKNRCELLSFLLLMDYIGINLATMNSCINPIALYFVSKKFKNCFQSCLCCCCYQSK 
SLMTS VPMNGTS I QWKNHDQNNHNTDRSSHKDSMN " 



1138 a 



859 c 



■843 g 1263 t 



2 others 



Query Match 100.0%; Score 4103; DB 9; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 4103; Conservative 0; Mismatches 2; 



Length 4105; 
Indels 0; 



Gaps 



0; 



Qy 

Db 



1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 

MIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIMIIIIMIIIIIMII 

1 GAATTCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGG 



60 



60 



Qy 

Db 

Qy 
Db 

Qy 
Db 



6 1 AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

6 1 AGACGGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCT 
121 CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 

I MM 1 1 II I MM II MM II II MM II MM Ml II II II II II II MINI II II I 

121 CGCGCGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTT 
181 C^TCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 

1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f i 1 1 1 1 1 1 

181 CATCCATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAA 



120 



120 



180 



180 



240 



240 



Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


■ V 601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


.961 


Db 


961 


Qy 


1021 


Db 


1021 



GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 3 00 

MIIIIIIIIIMIIIMIIIIIIIIMIIIIIMIIIIIIIIIIIIIIMIIMIIMI 

GCAGTGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGC 300 
CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

II I IIIIIMI 1 1 III I II II Mill INI I MM Mill Mill INI I MM I Mill 

CGGAGCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTC 360 

TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 420 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I II I I I II I I I! I I I I I I I I 
TGGCCCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAA 42 0 

AAAAAGTGAAGGTGTAAAAGCAG CACAAGTG CAATAAGAGATATTTCCTCAAATTTG CCT 480 

I M 1 1 1 1 1 1 II II II 1 1 1 II 1 1 1 1 II I II 1 1 1 II 1 1 1 1 II II II 1 1 II 1 1 1 1 II I II I II 

AAAAAGTGAAGGTGTAAAAGCAG CACAAGTG CAATAAGAGATATTTCCTCAAATTTG CCT 48 0 
CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I' 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 
CAGTGATAATCCTGAGAGATACAGCACAAATCTAAGCAATCA 600 

MIMIMIMMMMMMMMMMMMMMIMMMMMMMMMMM 

CAGTGATAATCCTGAGAGATAC^GC^CAAATCTAAGCAATC^TGTGGATGATTTCACCAC 600 
TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 

IIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

TTTTCGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCT 660 
ACCCAGCAATGGCTCAATGCACAACTATTGCCCACAGCAGA 72 0 

M M I M M II 1 1 1 1 1 1 1 1 1 1 1 1 II II II I II II II 1 1 1 1 1 1 1 1 II 1 1 1 II I II II 1 1 1 1 

* ACCCAGCAATGGCTCAATGCACAACTATTGCCCA 72 0 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 

IMIIIIIIIMIIIMIIIIIIIIIMIMIIIIIIIIIIIIIIMMIMMMIIM 

CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 780 
AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

I II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II I II 1 1 1 1 1 1 1 II I II I II 1 1 1 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 
AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

MMMMMIMMIMMMMMMMMMMMMIMMMIMIMMIMM 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 



I M M I II I II I M II 1 1 1 1 1 1 1 1 1 1 II I M 1 1 II 1 1 1 II I II 1 1 1 II II 1 1 1 1 1 M 1 1 1 



GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 

M II II 1 1 1 II II 1 1 1 1 II I II II 1 1 1 II 1 1 M II I II I II 1 1 1 M 1 1 1 1 1 1 1 M 1 1 II I 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 102 0 
TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

MMMMIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIMIMIIIMI 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 108 0 



Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


12 01 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 



TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

II Mill II III MM I INI Mill MINIMI I MM Mill Mill III! Mill I 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

IIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMM 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

TATGCTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 12 60 

MMMMMMIMMMMMMMMMMMMMMMMIMIMMMIMM 

TATGCTCAATGCCACATGAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCT 1260 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 1320 

IIMIMIIIIIIIIIIMIIIIIIIIMIIIMIIMIIMIIIIIIIIIIIIIIIMI 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACAGCCTCATGAC 132 0 

TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 1380 

II ill II II I II MM MM f J 1 i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i I M 1 1 1 1 1 1 1 E 1 1 1 1 1 

TTGTGAGATGTTGAACAGAANNAATGG CAGCTTGAGAATTGC C CTCAGTGAACATCTTAA 138 0 

GC^GCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

1 1 M 1 1 1 1 II II II 1 1 II II I II 1 1 1 II II II II II 1 1 1 II II 1 1 1 II I II 1 1 II II I II 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II M ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r IF 1 1 

GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

I II 1 1 1 II 1 1 1 1 1 1 M I II I M I M I II 1 1 M 1 1 II I M 1 1 1 1 II II II II 1 1 1 1 1 M M 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 162 0 

I M I II II I II II I II I II I II I M II M II I I I II I II I I II || M II II II II I I M I 
CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 162 0 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 168 0 

I I M 1 1 1 1 II I II II M II I II 1 1 1 M II I II II 1 1 II 1 1 II I II II II II I II I II I II 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 1680 

C^TGAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACCACAACACAGACCG 174 0 

I M I M M I M II 1 1 1 II I II II II I II II 1 1 1 II I II I II II I II II II II II 1 1 1 1 M 

CATGAACGGAACAAGC^TCC^GTGGAAGAACCACGATCAAAACAACCACAACACAGACCG 174 0 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 18 00 

MMIIIIIMIIIIIIIIIIIMIIIMIMMIIIII.IIIIIIIMIMIIMIIMI 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 18 00 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

MIMIIIIIIIIIIIIIIIMIIIMMIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 1920 

M I M II I M 1 1 II 1 1 II I II 1 1 II M I II II II I II 1 1 1 1 II I II 1 1 1 1 II II 1 1 1 1 1 1 

CTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGA 192 0 

1921 CTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATGTAATTTACATA 1980 



Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


" 2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 



IM II; Ml II Ml , II Ml III Ml Ml Ml II MIMII 

CTGGTTTATCC7VCCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATA 198 0 
TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 

MMIIIIIMIIIIIIIIIIIIIIMIIIIMIMIMIIIIIIMIIMMIIIIMI 

TTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTT 2 04 0 

AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 

M. M:l II Ml II .Ml II II. Ih II II i ll M I 

AAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGG 2100 

CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 

I M . II Ml Mi MM II II II il II MM IMM 

CTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAG 2160 
AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 2220 

M Ml Ml IMMI II II II MM M! IMM 1 1 Ml I! 

AGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACA 2220 
ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 22 8 0 

Ml MM IMMM II Ml MM IM IM II III Ml MIMM 

ATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCA 22 80 
TAATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA 234 0 

IIIIIIIMIIIIIIIIIIIIIIIIMIMIMMIIIMMIIIIIMIIIMMIIII 

TAATAGCCTAACATGATTATTTGAACTTATTTACAC^TAGTTTGAAAAAAAAAAGAa\AA 234 0 
AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 

IIIMIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII! 

AATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAA 24 00 
ACACAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACA 2460 

M Ml MM II II , II IM Ml II MIMIII IM II Ml 

ACACAAATTCTAAAGCTACAACAAATACTAG^GGCCCTTA 2460 
TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 2520 

IIIIIMIIMMIIMIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIMI 

TTTGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAA 2520 
ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2580 

IIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIMIIIMIIIMIIIIIMIII 

ATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGC 2580 
TTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 2640 

MM M M MM MMIMM MM IMM M M IMIMM I MM MM MM I Ml 

TTO^TCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGA 2640 
AAAATC^TGTO\AGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 27 00 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMMIIIMIIIIIIIIIMI 

AAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTT 2700 
TCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 

IIIIIIIIIIMMMIIMIIIIIIIIIMIIIIIIIIIIIIMIIIIUIIIIIIIII 

TCAGTGC^CTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAG 2760 
ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 2820 

Mill IIIIIIIIMIiMIIIIIIIII'llllllllllllll lllllillllll | 



Db 



2761 ATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATAT 282 0 



Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


^4 ft 1 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 



2880 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCA 2880 
TTCCCTTT.TCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 294 0 

MIIIII'IIIIIIMIIIIMIIIIIMIMIIIIMMIIIIIIIIMIIIIIMIIII 

TTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGT 294 0 
GAAAAATAATTACCCACAAATGCCA.CCAGTAACTTAACGATTCTTC^CTTCTTGGGGTTT 3000 

II il II! IN II I II III III MM II Mill I II 

GAAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTT 3000 
TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAG 3 060 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 

TCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCAGATTGTCACCATTTa^AAG 3 060 
GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 312 0 

1 1 1 1 .ii 1 1 1 1 1 1 1 1 1 ; 1 1 ii i . 1 1 1 M iii 1 1 1 1 1 ii 1 1 1 1 1 1 in 1 1 1 1 1 ii i 1 1 

GGCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAG 312 0 
AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 318 0 

Ml IMMM IM II M|| Ml IM IIMMI Ml Mm I 

AAAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCC 318 0 
ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

ATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGAT 324 0 
GGTATGTATGGATTTAATCTAATCTAATAATTGTGC CCCG CAGTTGTG CCAAAGTG CATA 33 00 

Ml : II MM MM M II MM: IMMM Ml II Ml 1 II 

GGTATGTATGGATTTAATCTAATCTAATAATTGTGCC C CG CAGTTGTGCCAAAGTG CATA 3300 
GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 

I Ml MM II MM II III Ml MM M;, II M Ml 

GTCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACC 3360 
TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 3420 

MM IM III M MM !l III Ml IM II 'ill Ml Ml 

TGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATA 342 0 
„AGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 34 8 0 

I lllllllll Mill MINIMI MM IIIIIIIIIMIII INI I MINIM Mill 

AGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGT 3480 
TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 

Ml III Mill II MM II III II II: IMMM IM M 

TGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTA 354 0 
CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGT 3600 

TGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 

I I I I i I I I I I I II I I I II I II I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I M | || 
TGTCTGATATTT CTTTCAGACTT CG CCAGACAGATTGCTGATAATAAATTAGGTAAGATA 3660 



Qy 3661 ATTTGTTGGG C CATATTTTAGGACAGGTAAAATAACATCAGGTT CCAGTTGCTTGAATTG 3720 

MMIIIIIIIIIMIIIIIIIIIIIIMIMIIIIIIMIIIIMIIIIIIIIIIIIII 

Db 3 661 ATTTGTTGGG CCATATTTTAGGA CAGGTAAAATAA CAT CAGGTT CCAGTTGCTTGAATTG 3 720 

Qy 3721 CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 3780 

IIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIMMIIMIIIIIIIIMIII 

Db 3721 CAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCG 378 0 

Qy 3781 CATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 

i M 1 1 1 1 1 1 II 1 1 1 II I IMM 1 1 1 1 1 Mi 1 1 1 ! I ! 1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 MNM 

Db 3781 CATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAG 384 0 

Qy 3 841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3 900 

IIIIIIMIIMIIIIIIMIIIIMIIIIIIIIIMIIIIIIIIMIIMIIIIIIIM 

Db 3841 ACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAA 3900 

Qy 3901 ATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAG 3960 

IIMIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIII 

Db 3 901 ATT CAGAAAGTCATAGATTT CTGAAGG CGT CAA CGTG CATTTTATTTATGGACTGGTAAG 3 960 

Qy 3961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 020 

M MMMI MM IMMI IM || ; Ml II MM IM II : I 

Db 3 961 TAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACA 4 020 

Qy 4 021 AGTAACTTTGTAGAAATGAG CCAGAAG CCAAGG CCCTGAGTTGG CAGTGG CCCATAAGTG 4 080 

II II 1 1 Ml IMMIMI IMMI IMMI II I II II II Mill I Mill II Mill III 

Db 4 021 AGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTG 4 08 0 

Qy 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 

II II I II Ml I II II Ml 1 1 1 Ml I 

Db 4 081 TAAAATAAAAGTTTACAGAAACCTT 4105 
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FEATURES Location/Qualifiers 
source 1 . . 4101 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 96 06" 
/tissue_type= "placenta" 
CDS 481. .1764 

/codon__s t art = 1 

/product ="endothel in- 1 receptor" 
/protein_id="BAA14359 . 1" 
/db_xref ="GI : 219650" 

/trans 1 a t ion= " METLCLRASFWLALVGCVI SDNPERYSTNLSNHVDDFTTFRGTE 
LSFLVTTHQPTNLVLPSNGSMHNYCPQQTKI TSAFKY I NTVI SCT I F I VGMVGNATLL 
RI I YQNKCMRNGPNALI ASLALGDLI YWI DLP I NVFKLLAGRWPFDHNDFGVFLCKL 
F PFLQKS SVG I TVLNLCALS VDR YRAVASWSR VQG I G I PLVTA IEIVSIWILSFI LA I 
PEAIGFVMVPFEYRGEQHKTCMLNATSKFMEFYQDVKDWWLFGFYFCMPLVCTAIFYT 
LMTCEMLNRRNGSLRIALSEHLKQRREVAKTVFCLWI FALCWFPLHLSRI LKKTVYN 
EMDKNRCELLSFLLLMDYIGINLATMNSCINPIALYFVSKKFKNCFQSCLCCCCYQSK 
SLMTS VPMNGTS I QWKNHDQNNHNTDRS SHKDSMN " 

BASE COUNT 1136 a 859 c 844 g 1262 t 

ORIGIN 

Query Match 99.9%; Score 4101; DB 9; Length 4101; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4101; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

TCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGGAGAC 6 4 

1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGGAGAC 6 0 
GGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCTCGCG 124 

Ml : II MM II MM II MM II I III II III IliMI 

GGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCTCGCG 12 0 
CGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTTCATC 184 

M ' II MM II M II III Ml II II MMIM II 

CGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTTCATC 180 
CATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAAGCAG 244 

I M Ml II MM I II II MM Ml II II II III II II II I II II II MM II I II II Mi 

CATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAAGCAG 24 0 
TGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGCCGGA 3 04 

MMMMMMIMMMMMMMMMMMMMMMMIMMMMIIMM 

TG C CCAGGAAGTTTT CTGAAG CCGGGGAAG CTGTG CAG CCGAAG C CGC CG CCG CG C CGGA 300 
GCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTCTGGC 364 

II M 1 1 1 M 1 1 1 II 1 1 1 II M 1 1 M I II II 1 1 1 II I M M II I II 1 1 M 1 1 1 1 1 II 1 1 1 1 

GCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTCTGGC 360 

CCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAAAAAA 424 

M I I I I M M II II I I II I I II I II II I I I I I II I I II I II I I II I I I I I II II II I I II 
CCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAAAAAA 42 0 
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185 


Db 


181 


Qy 
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Db 
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Db 
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Db 
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Qy 


545 


Db 


541 


Qy 


605 


Db 


601 


Qy 


665 


Db 


661 


Qy 


725 


Db 


721 


Qy 


785 


Db 


781 


Qy 


845 


Db 


841 


Qy 


905 


Db 


901 


Qy 


965 


Db 


961 


Qy 


1025 


Db 


1021 


Qy 


1085 


Db 


1081 


Qy 


1145 


Db 


1141 


Qy 


1205 


Db 


1201 



AGTGAAGGTGTAAAAGCAGC^C^GTGC^TAAGAGATATTTCCTCAAATTTGCCTC^AG 4 84 

MM Mill MM II MINI I II II I III INI I MM MM MM MINIMI II II 

AGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCTCAAG 4 8 0 
ATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGT 544 

IIIMIIIIIMIMIIIIIMIIIMIIIIIIIMIIIIIIIIIIIIIIMIIMIIII 

ATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGT 54 0 
GATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCACTTTT 604 

MIIIIIMIIIIIIIIIIIIIIMIIIIMMIIIIIMIIIIIIIIIIIIIMIIIII 

GATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCACTTTT 600 
CGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCTACCC 664 

IIMIIIIIIIMIIIIMIIIIIIMIIIIIIIIMIIMIIIMIIIIIIIIIIIIII 

CGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCTACCC 660 
AGC^TGGCTC^TGC^C^ACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTTCAAA 724 

IMIIIIIIMIIIIIIIIIMIIIIMIMIIMIIIIIIIIIIIMIIIIIIMIIII 

AGCAATGGCTCAATGCAC^CTATTGCCC^CAGCAGACTAAAATTACTTCAGCTTTCAAA 72 0 
TACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACT 784 

M Ml Ml II MM II II III II IMMII II IM : I 

TACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACT 780 
CTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGATAGCC 844 

MIMIIIIIIMMIMIIIIIIIMIIIIIIMIIMMIIIIIIIIIIIIIMIIII 

CTG CTCAGGAT CATTTACCAGAACAAATGTATGAGGAATGG CCC CAACG CGCTGATAG CC 84 0 
AGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAG 904 

IIMIIMIIIIIIIMIIIIMIIIIMIIIIIIIIMMIIIIIIIMIIIIIIMII 

AGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAG 9 0 0 

CTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTG 964 

MMIINMIIMIIMIIIIIIIMIIIIMIIMIIIMIMIIIMIIMMIMI 

CTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTG 960 

TTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGT 1024 

IMM MM II MM III 1 1 II 1 1 II II Ml I Ml || || || || | MM || MM Ml || | 

TTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGT 102 0 

GTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTG 1084 

I M I M I I I I I II M II II II II II I || || | | || | || | || | | | | | || || | M II II I II I 
GTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTG 108 0 

GTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAA 1144 

IMMMMMMMMMMMMMIMIMMMMMMMMMMMMIMM 

GTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAA 114 0 
GCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATG 12 04 

M I M M 1 1 II II 1 1 II I II I II 1 1 1 II I M 1 1 1 M I II II I M II II II 1 1 II 1 1 1 II I 

GCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATG 12 00 
CTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTC 1264 

MM I II II III MM II II I MM I Ml MM 1 1 Ml MM I 

CTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTC 1260 



Qy 1265 GGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGT 1324 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1261 GGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGT 132 0 

Qy 132 5 GAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAAGCAG 1384 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1321 GAGATGTTGAACAGAAGGAATGG CAG CTTGAGAATTG CC CTCAGTGAACATCTTAAG CAG 1380 

Qy 1385 CGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTC 1444 

MIIMIMIIMI llllllllllllll MMIIIIllllllllMIIIIIIMililM 

Db 13 81 CGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTC 144 0 

Qy 144 5 C CTCTTCACTTAAG CCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAAC CGA 1504 

MIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIMIIIIIIMIIMII 

Db 1441 C CTCTTCACTTAAG CCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGA 1500 

Qy 1505 TGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATG 1564 

IIIIIIIIIIMIIIIIIIIIMIIIIIMIIIIIIIIIIMMMIIIIIIIIIMMI 

Db 1501 TGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATG 1560 

Qy 1565 AATTC^TGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTC 1624 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

Db .1561 AATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTC 162 0 

Qy 162 5 CAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCATG 1684 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiin 

Db 1621 CAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCATG 168 0 

Qy 1685 AACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACC^ 1744 

IMIIIMIMIMIIIIIIIIMMMIIIIIIIIIIIJMIMMIIIIIIIIIIIM 

Db 1681 AACGGAACAAGCATCC^GTGGAAGAACCACGATCAAAACAACC^ 174 0 

Qy 174 5 AGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAATC 18 04 

IMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIMIIIIIIIIII 

Db 1741 AGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAATC 18 00 

Qy 18 05 CTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTTC 1864 

II IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII MINIMI lllllllllll 

Db . 18 01 CTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTTC 1860 

Qy 18 65 TTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGC^ 1924 

MMIMMIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIMIMMMII 

Db 18 61 TTCCTTAATTCACTCCCACACCC^GAAGAAATGCTTTCC^W^CCGa^GGTAGACTGG 192 0 

Qy 1925 TTTATCCACCCAC^ACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTCT 1984 

1 1 M 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 ! . 

Db 1921 TTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTCT 1980 

Qy 1985 G CGTGTTGTATT CAG CACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAAT 2 044 

I I M M I I I I I I II I I I i I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I M I I 
Db 1981 GCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAAT 204 0 

Qy 2 04 5 GAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATGG CTAG 2104 

IMIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIMIIIIIIIIIIIMIIII 

Db 2 041 GAAACCAGAAGGATATTTACTACTTTTGC^VTGAAAATAGAGCTTTCAAGTACATGGCTAG 2100 

Qy 2105 CTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGAT 2164 
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2645 
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Qy 


2705 


Db 


2701 


Qy 


2765 


Db 


2761 


Qy 
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Db 
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Qy 
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Db 


2881 


Qy 


2945 



IIIIIIIIIIIIMIIIIMIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGAT 2160 
TAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATAT 2224 

I Ml MM IM M III MM II IM III MMMIM Ml 

TAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATAT 222 0 
GGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAAT 2284 

Mill MM MM II III II III IM III M IMMI 

GGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAAT 2280 
AGCCTAACA.TGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAATA 2344 

IIIMMIIIIIIIIIIIIIIIIIIIIIIMIIIMIIMIIIIIIMIIIIMIIIIII 

AGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAATA 234 0 
GTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACAC 24 04 

I II M! MM MM MMMI ,11 ,MI MMMI ,MI ill 

GTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACAC 24 00 
AAATTCTAAAGCTACAACAAATACTACAGGCGCTTAAAGCACAGTCTGATGACACATTTG 2464 

M II III II M> MMMI M MM Ml IMM MM II 

AAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACATTTG 2460 
GCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATGG 2524 

M II MMMI M MM IM I! IM Ml Ml Ml 1 II 

GCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATGG 252 0 
TGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTCA 2584 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTCA 258 0 



MMMI MINIMI Mill IIIIIIIMI MM I MM Mill MM Mill Mill I 

ATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAAAA 
TCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCAG 

I l i : J i II III lli:: II 

TCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCAG 
TGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATGA 

M, II MM II I II MMMM II Ml III Ml Ml , MM 

TGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATGA 
TTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCACC 

IIIIIIIIIIIIIIMIIIIIIIMIMIIIIIIIIIMIIIMIIIIIIIIIMIIMI 

TTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCACC 
TCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTCC 

CTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAAA 

IIMIIIIIIMMIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

CTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAAA 
AATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCAG 

1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 IM 1 1 1 1 IM I MM I !l I II 1 1 1! II 1 1 1 II I 



2880 



Db 



2 941 AATAATTACCCAOWVTGCCACC^GTAACTTAACGATTCTTCACTTCTTGGGGTTTTCAG 3 000 
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3245 
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3241 


Qy 


3305 


Db 


3301 


Qy 


3365 


Db 


3361 


Qy 


3425 


Db 


3421 


Qy 


3485 


Db 


3481 


Qy 


3545 


Db 


3541 


Qy 


3605 


Db 


3601 


Qy 


3665 


DD 




Qy 


3725 


Db 


3721 


Qy 


3785 


Db 


3781 



TATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAGGGCC 3 064 

1 1 1 1 1 1 1 1 1 M I ! 1 : 1 1 1 1 1 1 1 1 !! i 1 1 1 1 1 1 1 1 ! M I M i ! 1 1 1 M 1 1 1 1 1 1 1 1 M 

TATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTC^CCATTTCAAAGGGCC 3 060 

CACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAAA '3124 

IMIIIMIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIMIII 

CACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAAA 312 0 

TCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATTT 3184 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATTT 318 0 

TTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGTA 3244 

IIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMI IIIMIIIMIIIIIIMII 

TTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGTA 324 0 

TGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTCT 3304 

I M I I'M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 

TGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTCT 3300 

GAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGTA 3364 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIMIIMIIIIIIIIIIIIIIMI 

GAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGTA 33 60 

GCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGAG 3424 

! I III 1 1 II I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I M 1 1 1 1 1 1 , M 1 1 1 1 1 III I II I M II I 

GCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGAG 342 0 

GCAG CGTGAAAG CAGATGAGCTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGTTGGT 3484 

1 1 1 1 1 1 1 1 1 IN 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 N M 1 1 1 1 1 

GCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGGT 348 0 

TTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACACT 3 544 

IMMIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACACT 3 54 0 

TTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGTC 3604 

II I MM IIIMIM I llllllll MM MM III !lll! I Mil III! MM MM III 

TTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGTC 3600 

TGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATTT 3664 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 

TGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATTT 3660 

GTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGCAAG 3724 

I II MM I MINI: II II III I MM II II ll'll 

GTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGCAAG 3 72 0 

GCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCATC 3784 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII! 

GCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCATC 3780 

ATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAGACAA 3844 

i i i 1 1 f 1 1 J 1 1 1 1 r 1 1 1 1 1 1 1 1 ( i j 1 1 j 1 1 1 1 1 i i i f i 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 

ATATG CAGTGATATATG CCTATAATATAAG CCATAGGTTCACACCATTTTGTTTAGACAA 384 0 



Qy 3845 TTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATTC 3904 

II Ml Ml M II MM II Ml. Ml MUM IMM! Ml 

Db 3841 TTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATTC 3 900 

Qy 3 905 AGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAAC 3964 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIMIII 

Db 3901 AGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAAC 3960 

Qy 3965 TGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTAC^TCTTTTCAACAAGTA 4 024 

IIIIIIIIIIMIIIIMIIIIIIIIMMIIIIIIIIIIIIIIIIIIIMIMIIIMI 

Db 3961 TGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGTA 4 02 0 

Qy 4 025 ACTTTGTAGAAATGAG CCAGAAG CCAAGGCCCTGAGTTGGCAGTGG CC CATAAGTGTAAA 4 084 

MMMIMMMMMMMMMMMMMMMMMIMMMMMIMIMM 

Db 4 021 ACTTTGTAGAAATGAG CCAGAAG C CAAGG CC CTGAGTTGGCAGTGGCCCATAAGTGTAAA 4 08 0 

Qy 4 085 ATAAAAGTTTACAGAAACCTT 4105 

IIIMIIIMMIIIIIIII! 

Db 4 081 ATAAAAGTTTACAGAAACCTT 4101 
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AR270694 4079 bp DNA linear PAT 10-APR-2003 

Sequence 1257 from patent US 6500938. 

AR270694 

AR270694.1 GI:29701928 

Unknown . 

Unknown . 

Unclassified. 

1 {bases 1 to 4079) 

Au-Young,J. and Seilhamer , J.J. 

Composition for the detection of signaling pathway gene expression 
Patent: US 6500938-A 1257 31-DEC-2002; 

Location/Qualifiers 

1. .4079 

/organ i sm= " unknown " 
1125 a 856 c 842 g 1256 t 



Query Match 99.4%; Score 4 079; DB 6; Length 4 079; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4079; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



Qy 

Db 

Qy 
Db 

Qy 
Db 



65 



61 



0; 



5 TCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGGAGAC 

IIIIIIIIIIIIIIIIILIIIIMIIIIIIIIIIIIIIMIIIIIMIIIIIIMIIMI 

1 TCGCGGCCGCCTCTTGCGGTCCCAGAGTGGAGTGGAAGGTCTGGAGCTTTGGGAGGAGAC 



GGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCTCGCG 

MMIIIMIIIMIMIIIIIIIIIIIIIMIMIIIIIMIIMIMIIIIMMIII 

GGGGAGGACAGACTGGAGGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCTCGCG 



64 



60 



124 



120 



125 CGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTTCATC 184 

Ml I Ml II II INI I II MM II MM IIIMMII II II II I II MM II Mill Ml 

121 CGCGCGTACAGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTTCATC 18 0 



Qy 


185 


Db 


181 


Qy 


245 


Db 


241 


Qy 


305 


Db 


301 


Qy 


365 


Db 


361 


Qy 


425 


Db 


421 


Qy 


485 


Db 


481 


Qy 


545 


Db 


541 


Qy 


605 


Db 


601 


Qy 


665 


Db 


661 


Qy 


' 725 


Db 


721 


Qy 


785 


Db 


781 


Qy 


845 


Db 


841 


Qy 


905 


Db 


901 


Qy 


965 


Db 


961 



CATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAAGCAG 244 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1! II 1 1 1 1 

CATCCCACCCGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAAGCAG 24 0 
TGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGCCGGA 304 

IMI II II lllllllll II MM II IMMIIMI II IMMM II llllllll MM II 

TGCCCAGGAAGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGCCGGA 300 
GCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTCTGGC 364 

1 1 1 1 1 1 1 m 1 1 1 ii ii 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 i 1 1 1 1 1 ii 1 1 1 

GCCCGGGACACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTCTGGC 360 
CCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAAAAAA 424 

IIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIMIIIIIIIII 

CCAGGCGCCGCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAAAAAA 42 0 
AGTGAAGGTGTAAAAGC7VGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCTCAAG 4 84 

llllllllllllll llllllll IMIIMMIIIIIMIIIIIIIIIIIIIMIIIIMI 

AGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCTCAAG 4 8 0 
ATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGT 544 

IMIIMIIIMIIIIIIIIIMIIIIIMIIIIIIIIIMIIIIIIIIIMIIIMIII 

ATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGT 54 0 
GATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCACTTTT 604 

IIMIIIIIIIIIIMIIMIIIIIIIIIIIMIIIIIIIIIIIIMMIMIIIMIII 

GATAATCCTGAGAGATACAGCACAAATCTAAG CAATCATGTGGATGATTT CACCACTTTT 600 
CGTGGC^CAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCTACCC 664 

rilllllMIIMMIIIIIIIIIMIIIIMIIIMIIIMIIIMMIIIIIIMIII 

CGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCTACCC 660 
AGCAATGGCTCAATGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTTCAAA 724 

I 'N I! Ml II II II I! II II II' M I 

AGCAATGGCTCAATGCAC^CTATTGCCC^CAGCAGACTAAAATTACTTCAGCTTTCAAA 720 
TACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACT 784 

IIIIMIIIIIIIIIIIIIIIIMIIIIIIIIMIMIIIIIIIIMIIIIIIIIIIMI 

TACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACT 78 0 
CTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGATAGCC 844 

I II II MM I II I Mill II II II II MM II II II II III II II II llllllll II III 

CTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGATAGCC 84 0 
AGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAG 904 

IN 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 

AGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATGAATGTATTTAAG 900 
CTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTG 964 

IMMMMMMMMMIMIMIMMMMMIMMMMMIMMMMIMI 

CTG CTGG CTGGG CG CTGG CCTTTTGATCACAATGACTTTGG CGTATTTCTTTGCAAG CTG 960 
TTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGT 1024 

MIMMMMMMMMIMMMMMMMMMMIMMMMMMMIMM 

TTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGT 102 0 



Qy 


1025 


Db 


1021 


Qy 


1085 


Db 


1081 


Qy 


1145 


Db 


1141 


Qy 


1205 


Db 


1201 


Qy 


1265 


Db 


1261 


Qy 


1325 


Db 


1321 


Qy 


1385 


Db 


1381 


Qy 


1445 


Db 


1441 


Qy 


1505 


Db 


1501 


Qy 


1565 


Db 


1561 


Qy 


1625 


Db 


1621 


Qy 


1685 


Db 


1681 


wy 


1 1 A 


Db 


1741 


Qy 


1805 


Db 


1801 



GTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTG 1084 

III MM INI MM I Mil MM MM MM MM II II I III I II II MM Mil III 

GTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTG 108 0 
GTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAA 1144 

lilllllllllll LIMIIIIIIIIIIIIIIIIIM'I I llilllllllllllll 

GTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAA 114 0 
GCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATG 1204 

IMIMIIIIMIIIIIIMIIIIIMMMIIIMIIIIIIIIIIIIIIMIIIMIII 

GCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATG 1200 
CTCAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTC 1264 

I II MM I III MM Mill IIIMIII MM MM III I MM I II MUM MM III 

CTCAATGCCA.CATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTC 1260 
GGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGT 1324 

I MM II Ml IMI II II II II IM IM II Ml IM I 

GGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGT 132 0 
GAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTG C CCT CAGTGAACATCTTAAG CAG 1384 

IIIMIIIIIIMMIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIII 

GAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTG C CCT CAGTGAACATCTTAAG CAG 138 0 
CGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTC 1444 

III MM MM III III III MIIIMI Mil MM MM I MM Mil MM MM III 

CGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTC 144 0 
CCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGA 1504 

IIIIMIIIIIMIIMIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

CCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGA 1500 
TGTGAATTACTTAGTTTCTTACTGCTC^TGGATTACATCGGTATTAACTTGGCAACCATG 1564 

1 1 1 II I II II II 1 1 II 1 1 1 1 1 II II 1 1 II II II II II II I II II I II I II II II 1 1 II 1 1 

TGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATG 1560 

AATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTC 1624 

I I I M I I II I I I I I II II I II II I I I I I II I I II I II M II I I II II I M II M I II II I 

AATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTC 1620 

CAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCATG 1684 

MIIIMMIMIMIIMIIIIIIIIMirMIIIIIIIIIIIIIIIIMIIIIMIII 

CAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCATG 1680 
AACGGAACAAGCATCCAGTGGAAGAACCACGATCAAA^ 1744 

IMIMIIIMIIIIIIIIIIJIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

AACGGAACAA.GCATCCAGTGGAAGAACCACGATC 174 0 

AGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAATC 18 04 

IMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIMIIIIMIM 

AGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAATC 18 0 0 
CTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTTC 1864 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 i M I M 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTTC 1860 



Qy 1865 TTCCTTAATTCACTCCCA.CACCC^GAAGAAATGCTTTCaU^CCGCAAGGTAGACTGG 1924 



Db 


1861 


Qy 


1925 


Db 


1921 


Qy 


1985 


Db 


1981 


Qy 


2045 


Db 


2041 


Qy 


2105 


Db 


2101 


Qy 


2165 


Db 


2161 


Qy 


. 2225 


Db . 


2221 


Qy 


2285 


Db 


2281 


Qy 


2345 


Db 


2341 


Qy 


2405 


Db 


2401 


Qy 


2465 


Db 


2461 


Qy 


2525 


Db 


2521 


Qy 


2585 


Db 


2581 


Qy 


2645 


Db 


2641 


Qy 


2705 



II II Mi II IM II II || Mil: II MMMM II 

TTCCTTAATTCACTCCCAC^CCC^GAAGAAATGCTTTCCTU^CCGC^AGGTAGACTGG 192 0 
TTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTCT 1984 

i m ii i m mi 1 immm mi: mi mi mi mmmi 

TTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTCT 1980 
GCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAAT 2 044 

II Ml MM MM IM II III MIMIIM IMMM II Ml 

G CGTGTTGTATTCAG CACTAAAAAATGGTGGGAG CTGGGGGAGAATGAAGACTGTTAAAT 2 04 0 
GAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGGCTAG 2104 

I MM I Mil III 1 1 MM MM I MM MINIMI MINIMI I MINIM Mill 

GAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGGCTAG 2100 
CTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGAT 2164- 

I i IM II IM II Ml MM M III IM II II MM 

CTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGAT 2160 
TAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATAT 2224 

M II Ml II M 1 M MMMM IMMM MMIM:MI 

TAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCtTCAGCCAAACACAATAT 222 0 
GGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAAT 22 84 

IIMIMIMIIIMIIIIIIMIIIIIIIIMIIIIIIIIIMIIIIIIIIIIMIIII 

GGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAAT 2 2 8*0 

AGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAATA 2344 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I 
AGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAATA 234 0 

GTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACAC 24 04 

1 1 1 1 1 N 1 1 1 N 1 1 1 II 1 1 II I II 1 1 1 II 1 1 II 1 1 II 1 1 II I N II II II II I II I II II 

GTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACAC 24 00 
AAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACATTTG 24 64' 

1 1 1 1 i i 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAATTCTAAAGCTAC^C^UyVTACTACAGGCCCTTAAAGCACAGTCTGATGACACATTTG 2460 
GCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATGG 2 524 

MMIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIMIIIIMIIMIIIIIIIIIIII 

GCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATGG 2 52 0 
TGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTCA 2584 

MMIIIIIIIMIIIIIIIIJIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

TGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTCA . 2580 
ATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAAAA 2 644 ■ 

IIIMIIIIIIIIMIIIIMIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIMIMI 

ATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAAAA 2 64 0 
TCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCAG 27 04 

1 1 1 1 N I M II II II 1 1 1 1 II 1 1 1 II I II II I II 1 1 1 M II II II 1 1 1 1 II 1 1 1 II II 1 1 

TCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCAG 2 700 
TG CACTGTATATAGAAGTCTAAAACACAC CTAAGAGAAAAAGATCGAATTTTTCAGATGA 2764 

M 1 1 1 1 1 M 1 1 M 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 ! | ! 1 1 1 1 ! 1 1 1 ! 1 1 1 ; 1 1 1 1 1 1 1 II I M 



Db 



2701 TGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATGA 2760 



Qy 


2765 


Db 


2761 


Qy 


2825 


Db 


2821 


Qy 


2885 


Db 


2881 


Qy 


2945 


Db 


2941 


Qy 


3005 


Db 


3001 


Qy 


3065 


Db 


3061 


Qy 


3125 


Db 


3121 


Qy 


3185 


Db 


3181 


Qy 


3245 


Db 


3241 


Qy 


3305 


Db 


3301 


Qy 


3365 


Db 


3361 


Qy 


3425 




7 A O 1 


Qy 


3485 


Db 


3481 


Qy 


3545 


Db 


3541 



TTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCACC 2824 

INIIIIIIIIIINIIIIIIIIIIIIIIIIIINIIIIIIIIIIIIIIIIINIIINI 

TTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCACC 2 82 0 
TCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTCC 2 884 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 

TCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTCC 2 88 0 
CTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAAA 2 944 

IMMMIMIIIMIIIIIMIIIIIIIIIMMMIIIIIMMIIMIIIIIIIIII 

CTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAAA 2 94 0 
AATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCAG 3 004 

IIIIIIIIMMIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

AATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCAG 3 000 
TATGAACCTAACTGCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAGGGCC 3 064 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 II i 1 1 1 1 II II I II 1 1 1 1 1 1 1 1 

TATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAGGGCC 3060 
CACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAAA 3124 

IIIIMIIMIIIIIIMIMIIIIIIIMIMIIIIIIIIIIIIIIIIIIMMIIMI 

CACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAAA 312 0 
TCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATTT 3184 

M'N! ii ir ii :ii ii ,ii ir u ii ii mi mi ii 

TCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATTT 318 0 

TTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGTA 3244 

I I I I I M I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
TTCTAGACTGT CTCTGTGGAATATATTTGTGTGTGTGATATATG CATGTGTGTGATGGTA 324 0 

TGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTCT 3304 

I !l II' I II Jl II II IMIl < II Ml II II I! I 

TGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTCT 33 00 
GAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGTA 33 64 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGTA 3360 
GC^ACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGAG 3424 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGAG 342 0 
G CAGCGTGAAAGCAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGTTGGT 34 84 

I II INI INI I III I MM INI MINIM III Mill INI MM INI lllllll 

G CAGCGTGAAAGCAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGTTGGT 34 8 0 
TTGATAAAGCAGTATTTGiGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACACT 3544 

Ml Mill III Mill MIMMMIII MM MM Mill MM MM MINIM III 

TTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACACT 354 0 
TTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGTC 3604 

NNIIIININIIININNIINNNININIIIINIIINMNNNNII 

TTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGTC 3600 



Qy 3 605 TGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATTT 3664 

i ii ri m i ii 1 1 1 1 1 1 1 1 1 i i ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3601 TGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATTT 3660 

Qy 3665 GTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGCAAG 3724 

IIIIIIIIIIIIIIIMLIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 3661 GTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGCAAG 3720 

Qy 372 5 GCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCATC 3784 

IIIIMIIIIIIIMMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMI . 

Db 3721 GCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCATC 3780 

Qy 3785 ATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAGACAA 3844 

IIIIIMIMIIIIIIIIIIIIIIIIIMIIIIMIIMIIIIIIIIIIIIIIMIMM 

Db 3781 ATATG CAGTGATATATGCCTATAATATAAGC CATAGGTTCACAC CATTTTGTTTAGACAA 384 0 

Qy 3845 TTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATTC 3 904 

II III II III IM ill II IIIMIMII III II 'MiJI 

Db 3841 TTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATTC 3900 

Qy 3 905 AGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAAC 3964 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3 901 AGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAAC 3960 

Qy 3 965 TGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGTA 4 024 

IIIIIMIIIIIIMIIIIIII llllillllMIIIIIIIMIIIIIIIIII IMIIII 

Db 3 961 TGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGTA 4 020 

Qy 4 025 ACTTTGTAGAAATGAGC CAGAAG CCAAGG CCCTGAGTTGGCAGTGGCC CATAAGTGTAA 4 083 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

Db 4 021 ACTTTGTAGAAATGAGC CAGAAG CCAAGG CC CTGAGTTGGCAGTGGC CCATAAGTGTAA 4 079 
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TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REMARK 



FEATURES 

source 



S45956 3305 bp mRNA linear PRI 08-MAY-1993 

endothelin receptor subtype A [human, placenta, mRNA, 3305 nt] . 
S45956 

S45956.1 GI:257375 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 {bases 1 to 3305) 

Hayzer,D.J., Rose, P.M., Lynch, J. S., Webb,M.L., Kienzle, B . K. , 

Liu,E.C, Bogosian, E.A. , Brinson,E. and Runge,M.S. 

Cloning and expression of a human endothelin receptor: subtype A 

Am. J. Med. Sci . 304 (4), 231-238 (1992) 

93035452 

1415318 

GenBank staff at the National Library of Medicine created this 
entry [NCBI gibbsq 115436] from the original journal article. 
This sequence comes from Fig. 2. 

Loca t i on/ Qua 1 i f i er s 

1. ..3305 



/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref ="taxon: 9606" 
gene 1. .3305 

/gene="endothelin receptor subtype A, ET-R [A] " 
CDS 222. .1505 

/gene="endothelin receptor subtype A, ET-R [A]" 

/note="This sequence comes from Fig. 2; ET-R [A] " 

/ codon_s t art = 1 

/product ="endothel in receptor subtype A" 
/protein_id="AAB23644 . 1" 
/db_xref="GI : 257376" 

/ 1 ran s 1 a t i on= " METLCLRAS FWLALVGCVI SDN P ERYSTNLSNHVDDFTTFRGTE 
LSFLVTTHQPTNLVLPSNGSMHNYCPQQTKITSAFKYINTVISCTIFIVGMVGNATLL 
RIIYQNKYMRNGPNALIASLALGDLIYWIDLPINVFKLLAGRWPFDHNDFGVFLCKL 
FPFLQKSSVGITVLNLCALSVDRYRAVASWSRVQGIGIPLVTAIEIVSIWILSFILAI 
PEAI GFVMVPFEYRGEQHKTCMLNATSKFMEFYQDVKDWWLFGFYFCMPLVCTAI FYT 
■ LMTCEMLNRRNGSLRIALSEHLKQRREVAKTVFCLWI FALCWFPLHLSRILKKTVYN 
EMDKNRCELLSFLLLMDYIGINLATMNSCINPIALYFVSKKFKNCFQSCLCCCCYQSK 
SLMTS VPMNGTS I QWKNHDQNNHNTDRS SHKDSMN " 

BASE COUNT 932 a 702 c 654 g 1013 t 4 others 

ORIGIN 



Query Match 77.5%; Score 3183.2; DB 9; Length 3305; 

Best Local Similarity 98.9%; Pred. No. 0; 

Matches 3275; Conservative 0; Mismatches 27; Indels 8; Gaps 7; 



Qy 


263 


AAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGCCGGAGCCCGGGACACCGGCCAC 

MMIIIMIIIIIMIIIIIIIIIMIIIMIIIIIIIIIIIIIIMMIIIIIIIIM 

AAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGCCGGAGCCCGGGACACCGGCCAC 


322 


Db 


■1 


60 


Qy 


323 


CCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTCTGGCCCAGGCGCCGCGCGGACC 

IMMIIIIMIIIIIIMIII IMMMMMIMMMMMMIMMMIMI 

CCTCCGCGCCACCCACCCTCGCCGGCTCCGGCTTCCTCTGGCCCAGGCGCCGCGCGGACC 


382 


Db 


61 


120 


Qy ■ 


383 


CGGC^GCTGTCTGCGCACGCCGAGCTCC^CGGTGAAAAAAAAAGTGAAGGTGTAAAAGCA 

IN M II Ml II II II MM II MM II MINI 1 II 1 1 1 1 1 II 1 1 II II II II M II 1 

CGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTG-AAAAAAAAGTGAAGGTGTAAAAGCA 


442 


Db 


121 


179 


Qy 


443 


GCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCTCAAGATGGAAACCCTTTGCCTC 

MMMIIIIIIMIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

GCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCTCAAGATGGAAACCCTTTGCCTC 


502 


Db 


180 


239 


Qy 


503 


AGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGTGATAATCCTGAGAGATAC 

M 1 II II 1 1 II 1 M II M 1 II 1 M II II 1 II 1 II M II II II II M II II M II 1 1 1 II 1 

AGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGTGATAATCCTGAGAGATAC 


562 


Db 


240 


299 


Qy 


563 


AGCACAAATCTAAGCAATCATGTGGATGATTTCACCACT^ 

III MINIMI Mil MINIM MM Mil Mill INI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 

AGCA.CAAATCTAAGCAATCATGTGGATGATTTCACCACTTTTCGTGGCA 


622 


Db 


300 


359 


Qy 


623 


TTCCTGGTTACCACTCATCAACCCACTAA 

1 1 1 1 M M 1 II II 1 II 1 II M II II II 1 M 1 1 1 M 1 II 1 1 M 1 1 M II II II II 1 M M 

TTCCTGGTNACCACTCATCAACCCACTAATTTGGTCCTACCC^ 


682 


Db 


360 


419 


Qy 


683 


AACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTTCAAATACATT 

M 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 ! 1 1 1 1 1 1 II Mill Mill MMM Mill III 


742 



Db 



42 0 AACTATTGCCCACAGCAGACTAAAAT^ 479 



Qy 


743 


Db 


480 


Qy 


803 


Db 


540 


Qy 


863 


Db 


600 


Qy 


923 


Db 


660 


Qy 


983 


Db 


720 


Qy 


1043 


Db 


780 


Qy 


1103 


Db 


840 


Qy 


1163 


Db 


900 


Qy 


1223 


Db 


960 


Qy 


1283 


Db 


1020 


Qy 


1343 


Db 


1080 


Qy 


1403 


nb 


± -L rr KJ 


Qy 


1463 


Db 


1200 


Qy 


1523 


Db 


1260 



TCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTAC 8 02 

I! ' I! II 1 'II It!! II III; II 'III: II 'III. II III 

TCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTAC 53 9 
CAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGAC 862 

I II 1 1 Ml II IMIIIIIMIMIIIMHMIIIIIIIIIIIIIIIIIIIIMIIIII 

CAGAACAAATATATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGAC 599 
CTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGG 922 

IIIIIMIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIII 

CTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGG 659 
CCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAG 982 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAG 719 
TCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCA 1042 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIUMIIIIIIIIII 

TCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCA 779 
GTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATT 1102 

IIIIIIMIIIIIIIIIIIIMMIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

GTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATT 83 9 
GTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATG 1162 

IIIMMIMMMIMMMMMMMMMMMMIIMMMMMIMMIMI 

GTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATG 899 
GTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCCACATCAAAA 1222 

MIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII.MIIIIIIM 

GTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCCACATCAAAA 959 
TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 1282 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 1019 
CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 1342 

I INN MM MM I MM lllllllll Mill MM I III 1 1 III I MM INI Mill 

CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 107 9 
AATGG CAGCTTGAGAATTG CCCTCAGTGAACATCTTAAG CAG CGTCGAGAAGTGG CAAAA 14 02 

IMMMMIMIMMMIIMMMMMIMMMMIMMMMIIMMIMM 

AATGG CAG CTTGAGAATTGCC CTCAGTGAACATCTTAAG CAG CGTCGAGAAGTGG CAAAA 1139 
ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGT 14 62 

IIMIIIIMIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII lllllllll 

ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCATTTAAGCCGT 1199 
ATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTC 1522 

IIIIIIIIIIMIIIIIIIIIIIIII lllllllllllllllllllllllllllllllll 

ATATTGAAGAAAACTGTGTATAACGAGATGGACAAGAACCGATGTGAATTACTTAGTTTC 125 9 
TTACTGCTC^TGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCC 1582 

I I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n l j 1 1 1 1 1 1 1 

TTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCC 1319 



Qy 


1583 


Db 


1320 


Qy 


1643 


Db 


1380 


Qy 


1703 


Db 


1440 


Qy 


1763 


Db 


1500 


Qy 


1823 


Db 


1560 


Qy 


1883 


Db 


1620 


Qy 


1942 


Db 


1680 


Qy 


2002 


Db 


1740 


Qy 


2062 


Db 


1800 


Qy 


2122 


Db 


1860 


Qy 


2182 


Db 


1920 


Qy 


2241 


Db 


1980 


Qy 


2301 


Db 


2040 


Qy 


2361 


Db 


2100 



ATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGC 1642 

MMMIMMIMMIIMIMMMMMMIIIMIMMMIMIMMMIMM 

ATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGC 137 9 
TGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCATGAACGGAACAAGCATCCAG 1702 

ill II :M III Ml "II III IMIIII II Ml: Ml II 

TGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCATGAACGGAACAAGCATCCAG 1439 
TGGAAGAACCACGAT CAAAACAAC CA CAA CA CAGA C CGGAG CAG C CATAAGGA CAG CATG 1762 

1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 

TGGAAGAACCACGATCAAAACAACCACAACACAGACCGGAGCAGCCATAAGGACAGCATG 14 99 
AACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAATCCTCTCGGAGAAAAAAATC 1822 

MM Mill! II 1 1 II III II I IIIIIIIIMI IMIIII II IMIIII II Mill I III 

AACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAATCCTCTCGGAGAAAAAAATC 1559 
ACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTTCTTCCTTAATTCACTCCCA 1882 

I MM I INI III 1 1 III MM IIIIIIIIMIIIIIIIIIIIIIII IMIIII 1 1 III 

ACAAGGCAACTGTGAGTCCGGGAATCTCTTCTCTGATCCTTCTTCCTTAATTCACTCCCA 1619 
CACCCAAGAAGAAATGCTTTCCAAAACCGCAA - GGTAGACTGGTTTATCCACCCACAACA 1941 

IIIMIIIIIIIIIIIIIIMIIIIIIIMII IMIIII lllllllllll Mill MM 

CACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGGTAGACTGGTTTATCCACCCACAACA 1679 
TCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTCTGCGTGTTGTATTCAGCA 2 001 

Ml 1 1 Mill III II II Mill 1 1 II MMMI MM III II MMMI II MM II I II . 

TCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTCTGCGTGTTGTATTCAGCA 173 9 

CTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAATGAAACCAGAAGGATATT 2 061 

I M M II I I I I I II I I I I I M I I I I II I I I I I I I I I I M II II II I I I I I I M I I I I I I I 
CTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAATGAAACCAGAAGGATATT 1799 

TACTACTTTTG CATGAAAATAGAG CTTT CAAGTACATGGCTAGCTTTTATGGCAGTT CTG 2121 

MMMIMMMMMMMMMMMMMMMMMMMMMMIMM Ml 

TACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATGG CTAGCTTTTATGGCAGTCCTG 1859 
GTGAATGTTCAATGGGAACTGGTCACCA.TGAAACTTTAGAGATTAACGACAAGATTTTCT 2181 

M II I II 1 1 1 1 I MMMM II. MMMI MMMM I MMM Ml 

GTGAATGTTGACTGGGAACTGGTCACCATGAAACTTTAGAGATTAACGACAAGATTTTCT 1919 
ACTTTTTTTAAGTGA-TTTTTTGTCCTTCAGCCAAACACAATATGGGCTCAG^ 224 0 

IMIIIIIIIIIIirilMIII MMMMMMMMMMIIIMM MMMM 

ACTTTTTTTAAGTGATTTTTTTGGCCTTCAGCO^CACAATATGGGCTCAAGTCACTTT 1979 
TATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAATAGCCTAACATGATTAT 2300 

'III MM III II'MII! II ; !l, Ml IM M II, II : il 

TATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAATAGCCTAACATGATTAT 2 03 9 
TTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAA 23 60 

I M 1 1 II M II 1 1 1 1 II I II M I II 1 1 II II II II II II 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 

TTGAACTTATTTAC^C^TAGTTTGAAAAAAAAAAGACAAAAATAGTATTC^GGTGAGC^ 2 099 
TTAGATTAGTATTTTC(^CGTCACTATTTATTTTTTTAAAAC^CAAATTCTAAAGCTACA 242 0 

M M M M M I M 1 1 M 1 1 1 1 II 1 1 IMIMIIIIIMIIIIIIIIIIIIIIIIIIIII 

TTAGATTAGTATTTTCCACGTCACTGTTTATTTTTTTAAAACACAAATTCTAAAGCTACA 215 9 



Ov 


9471 


A PA A ATAPTAPAPPPPPTTA A APPAPAPTPTY" 1 A TP A PA PA TTTPPPnPTTTA A TAPATPT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ACAAATACTACAGGCCCTTAAA 


z4o U 


Db 


2160 


2219 


Qy 


2481 


TACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATGGTGTTTTATTACAAGGG 

III MM MM MINIMI MINIM INI Mil Mil Mill IIIMIIIIIIIIII 

TACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATGGTGTTTTATTACAAGGG 


2540 


Db 


2220 


2279 



Qy 2541 ACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTCAATCAGATAGTTCTTTT 2600 

IHI II i U III m ||: m || 'Mm M1 , || | 

Db 2280 ACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTCAATCAGATAGTTCTTTT 233 9 

Qy 2601 TCACAAGTTCAATACTGTTTTT 2660 

• III MM III I II I ! 1 1 ! : 1 1 I II MM Mill IIIMIIIIIIIIII 

Db 234 0 TCA.CAAGTTCAAT-CTGTTTTTCATGTAAATTTCGTATGAAAAATC^TGTCAAGTACCA 2398 

Qy 2661 AAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCAGTGCACTGTATATAGAA 272 0 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMII IIIMIIIIIIIIII 

Db 2399 AAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCAGAGCACTGTATATAGAA 2458 

Qy 2721 GTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATGATTCGGAAATTTTCATT 278 0 

M M Ml I II M I .11 M II Ml. Ml Mill 

Db 24 59 GTCTAAAACACACCTAAGAG-AAAAGATCGAATTTTTCAGATGATTCAGAAATTTTCATT 2517 



Qy 


2781 


CAGGTATTTGTAATAGTGACATATATATGTATATACATATCACCTCCTATTCTCTTAATT 


2840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 II 1 1 1 1 II II II 1 1 1 1 1 II 1 1 1 1 1 1 II II 1 1 II 1 II 1 1 




Db 


2518 


CAGGTATTTGGAATAGTGACATATATATGTATATACATATCACCTCCTATTCTCTTAATT 


2.577 


Qy 


2841 


TTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTCCCTTTTCCATATAGGAA 


2900 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I M 1 




Db 


2578 


TTTCTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTCCCTTTTCCATATAGGAA 


2637 



Qy 


2901 


ACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAAAAATAATTACCCACAAA 

1 1 1 M II 1 1 II II 1 1 II 1 II 1 1 II 1 1 II II 1 1 1 1 II 1 II 1 II 1 1 1 1 1 II 1 1 1 II 1 II II 1 

ACATAATTTTGAAGTGGCCAGATGAGTTTATC^TGTC^GTGAAAAATAATTACCCACAAA 


2960 


Db 


2638 


2697 


Qy 


2961 


TGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCAGTATGAACCTAACTCCC 

Ml II Nil 1 1 MM Nil INI INI 1 INI II II INI INI INI MINN 

TGCCACCAG - - AACTTACGATTCTTCACTTCTTGGGGTTTTCAGTATGAACCTAACTCCC 


3020 


Db 


2698 


2755 


Qy 


3021 


CACCCCAACATCTCCCTCCCACATTGTCACC^TTTCAAAGGGCCCACAGTGACTTTTGCT 

II 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 II 1 1 1 II 1 1 1 1 II 1 1 1 1 II 1 1 1 1 II 1 II II 1 II 1 1 II 1 1 II 

CACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAGGGCCCACAGTGACTTTTGCT 


3080 


Db 


2756 


2815 


Qy . 


3081 


GGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAAATCTTTTACTAGTGTGT 

III INI lllll MINIMI IN NNINII INI II IIIIIMIIIIIIINII III 

GGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAAATCTTTTACTAGTGTGT 


3140 


Db 


2816 


2875 


Qy 


3141 


GTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATTTTTCTAGACTGTCTCTG 

MMM Ml MM iMMi II, II 1. (MMMMI II, II 1 

GTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATTTTTCTAGACTGTCTCTG 


3200 


Db 


2876 


2935 


Qy 


3201 


TGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGTATGTATGGATTTAATCT 

MMII || IIIIIIIINIII II II MINIMI INI IMMI II. 1 II 

TGGAATANATATGTGTGTGTGATATATGCATGTGTGTGATGATATGTATGGATTTAANCT 


3260 


Db 


2936 


2995 



Qy 



3261 AATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTCTGAGCTAAAATCTAGGT 3320 



IIIIIIIIIMIIIIillMIIIIIIIIIIIIIMIIIIIIIIIIII 

2 996 AATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTCTGAG - 



I I I I I I I I I I I I 
TAAAATCTAGGT 3 054 



Db 

Qy 3321 GATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGTAGCAACCTTCTGCATTC 3380 

Ml I 1 1 1 1 1 1 1 II I ! 1 1 II I 1 1 llll 1 1 1 1 1 1 1 Ml II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 

Db 3 055 GATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGTAGCAACCTTCTGCATTC 3114 

Qy 3381 ATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGAGG CAG CGTGAAAG CAGA 344 0 

IMIIIIIIIIIIIIIIIIMIIIIIMIMIIIIIIIMIIIIIIIMIIIIIMIIII 

Db 3115 ATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGAGG CAGCGTGAAAG CAGA 3174 

Qy 3441 TGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGGTTTGATAAAGCAGTATT 3500 

IIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIMIIIMIIMIMI 

Db 3175 TGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGTTGGTTTGATAAAG CAGTATT 3234 

Qy 3 501 TGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACACTTTGAAGTATTATATTG 3560 

IMIIIIIIIIIMI IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMII 

Db 3235 TGGGGTCATATTGTTNCCTGTGCTGGAGCAAAAGTCATTACACTTTGAAGTATTATATTG 32 94 

Qy 3561 TTCTTATCCT 3570 

MINI III 
Db 32 95 TTCTTAGCCT 33 04 



RESULT 10 

BC022511 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC022511 2705 bp - mRNA linear PRI 04-FEB-2002 

Homo sapiens, endothelin receptor type A, clone MGC: 26548 
IMAGE:4812050 / mRNA, complete cds . 
BC022511 

BC022511.1 GI : 184 90297 
MGC. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 2705) 

Strausberg,R. 

Direct Submission 

Submitted (01-FEB-2002) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 
Contact : MGC help desk 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Miklos Palkovits, M.D., Ph.D. 

cDNA Library Preparation: Michael J. Browns tein (NHGRI) & Shiraki 
Toshiyuki and Piero Carninci (RIKEN) 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Sequencing Group at the Stanford Human Genome 

Center, Stanford University School of Medicine, Stanford, CA 94305 

Web site: http://www-shgc.stanford.edu 

Contact: (Dickson, Mark) mcd@paxil.stanford.edu 

Dickson, M.,' Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, ' 
R. M. 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 32 Row: j Column: 7 

This clone was selected for full length sequencing because it 
passed the following selection criteria :■ matched* mRNA gi : 4503464. 

Location/Qualifiers 

1. .2705 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref=" Locus ID: 1909" 

/ db_xr e f = " t axon : 9 6 0 6 " 

/clone="MGC: 26548 IMAGE : 4812050 " 

/ tissue_type="Brain / hippocampus" 

/ clone_l ib= " NI H_MGC_9 5 " 

/lab_host="DH10B" 

/ note= "Vector : pBluescript " 

351. .1634 

/codon_start=l 

/product ="endothel in receptor type A" 
/protein_id="AAH22511 .1" 
/db_xref="GI : 18490298" 

/ 1 rans 1 at ion= " METLCLRASFWLALVGCVI SDNPERYSTNLSNHVDDFTTFRGTE 
LSFLVTTHQPTNLVLPSNGSMHNYCPQQTKITSAFKYINTVISCTIFIVGMVGNATLL 
R 1 1 YQNKCMRNGHNAL I ASLALGDL I YWI DLP I NVF KLLAGRW P FDHNDFG VFLCKL 
F PFLQKS SVG I TVLNLCALS VDR YRAVAS WSRVQG I G I PLVTA IEIVSIWILSFI LA I 
PEAIGFVMVPFEYRGEQHKTCMLNATSKFMEFYQDVKDWWLFGFYFCMPLVCTAIFYT 
LMTCEMLNRRNGSLRIALSEHLKQRREVAKTVFCLWIFALCWFPVHLSRILKKTVYN 
EMDKNRCELLSFLLLMDYIGINLATMNSCINPIALYFVSKKFKNCFQSCLCCCCYQSK 
SLMTSVPMNGTS I QW KDHDQNNHNTDR S S H KDS MN " 
767 a 594 c 560 g 784 t 



Query Match 63 . 6%; 

Best Local Similarity 99.3%; 
Matches 2673; Conservative 



Score 2609.6; 
Pred . No . 0 ; 
0; Mismatches 



DB 9; Length 2705; 



14; Indels 



5 ; Gaps 



5; 



Qy 

Db 



134 AGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTTCATCCATCCCACC 193 

I I I I I I I I I I II I I I I I I I li I I I I II I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I 
1 AGTCATCCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTTCATCCATCCCACC 60 



Qy 

Db 



194 CGGTCGTCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAAGCAGTGCCCAGGA 253 

II II MM MM II II II II II II IMI II I MMMMMMMMIMMMIMM 

61 CGGTCGTCGCCGGGGATTGGGGTCCCAGCGAGACCTCCCCGGGAGAAGCAGTGCCCAGGA 120 



Qy 

Db 

Qy 
Db 

Qy 
Db 



254 AGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGCCGGAGCCCGGGAC 313 

MMMMMMMMMMMMMMMMMMMIMMMMMMMMIMM 

121 AGTTTTCTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGCCGGAGCCCGGGAC 18 0 

314 ACCGGCCACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTCTGGCCCAGGCGCC 373 

M I I II M II M II II I II II II I II II II I i I I I I 1 I I 1 I 1 f [ I I I I I I I I E I 1 I I 

181 ACCGGCCACCCTCCGCGCCACCCACCCTCGCCGGCTCCGGCTTCCTCTGGCCCAGGCGCC 24 0 

374 GCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAAAAAAAGTGAAGGT 433 

1 1 1 1 f 1 1 1 1 1 1 r 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 r f 1 1 1 1 r 1 1 1 1 f mmmmimimm 

241 GCGCGGACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTG - AAAAAAAAGTGAAGGT 2 99 



Qy 



434 GTAAAAGCAGC^C^GTGCAATAAGAGATATTTCCTCAAATTTGCCTa^GATGGAAACC 4 93 



I II I MM Mill I MM I llll I MM Mill I MM I MM I MM I MM I II II 1 1 

3TAAAAG CAG CACAAGTG CAATAAGAGATATTTCCTCAAATTTGC CTCAAGATGGAAACC 359 



Db 


300 


GTAAAAG CAG CACAAGTG CAATAAGAGATATTTCCTCAAATTTGC CTCAAGATGGAAACC 


359 


Qy 


494 


CTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGTGATAATCCT 

IIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMMMIIIIIIII 

CTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGTGATAATCCT 


553 


Db 


360 


419 


Qy 


554 


GAGAGATACAGCACAAATCTAAGCAATC^VTGTGGATGATTTCACCACTTTTCGTGGCACA 

MMMMMMMMMMMMMMMMMMMMMMMMMIMMIMM 

GAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCACTTTTCGTGGCACA 


613 


Db 


420 


479 


Qy 


614 


GAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCTACCCAGCAATGGC 

IMMIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIMIIIIIIIIM 

GAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCTACCCAGCAATGGC 


673 


Db 


480 


539 


Qy 


674 


TCAATGCACAACTATTGCCC^CAGCAGACTAA 

MINI II Ml III MM II! II llllll llll MINI llll 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 

TCAATGCAC^CTATTGCCCACAGCAGACTAAAATTACTTCAGCTTTCAAATACATTAAC 


733 


Db 


540 


599 


Qy 


734 


ACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGG 

1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 M II 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 II 
ACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGG 


793 


Db 


600 


659 


Qy 


794 


ATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCC 
1: II ill IM M II II 1, II; 1! Ill, Ml : 

ATCATTTACCAGAACAAATGTATGAGGAATGG C CACAACG CG CTGATAG CCAGTCTTG CC 


853 


Db 


660 


719 


Qy 


854 


CTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCT 

MIIIIMIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIMI 

CTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCT 


913 


Db 


720 


779 


Qy 


914 


GGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTT 

IMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTT 


973 


Db 


780 


839 


Qy 


974 


TTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGG 

1 1 M II 1 M M 1 1 M 1 1 1 1 1 1 II II 1 1 II II 1 1 1 1 II 1 II II 1 II M II M M 1 II II 1 1 

TTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGG 


1033 


Db 


840 


899 


Qy 


1034 


TACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCC 

MM llll II II II II II III Mill II Mill II MM II II MUM II III llllll 

TACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCC 


1093 


Db 


900 


959 


Qy 


1094 


ATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGC 

MM llll III llllll II llll Ml III IIIIIIIIIIIIIMIIMIMIIIMIII 

ATTGAAATTGTCTCCATCTGGATCCTGTCGTTTATCCTGGCCATTCCTGAAGCGATTGGC 


1153 


Db 


960 


1019 


Qy 


1154 


TTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCC 

II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
I I I 1 1 1 1 1 M 1 M I 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 M I I M 1 1 M 1 1 1 [ 1 1 1 1 1 1 1 1 1 I I 1 

TTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCC 


1213 


Db ' 


1020 


1079 


Qy 


1214 


ACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTAT 

MM Mill II II III II Mill Mill MUM MM llllll Mill llll II 

ACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTAT 


1273 


Db 


1080 


1139 


Qy 


1274 


TTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTG 


1333 



MMMIMIMIMIMMMMMMMIIMMMMMMMMMMIMIMM 



Db 


1140 


Qy 


1334 


Db 


1200 


Qy 


1394 


Db 


1260 


Qy 


1454 


Db 


1320 


Qy 


1514 


Db 


1380 


Qy 


1574 


Db 


1440 


Qy 


1634 


Db 


1500 


Qy 


1694 


Db 


1560 


Qy 


1754 


Db 


1620 


Qy 


1814 


Db 


1680 


Qy 


1874 


Db 


1740 


Qy 


1933 


Db 


1800 


Qy 


1993 


Db 


1860 


Ov 


2053 


Db 


1920 


Qy 


2113 


Db 


1980 



114 0 TTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTG 1199 
AACAGAAGGAATGGCAG CTTGAGAATTG C C CTCAGTGAACAT CTTAAG CAG CGTCGAGAA 1393 

MIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII! 

AACAGAAGGAATGGCAG CTTGAGAATTG CCCTCAGTGAACATCTTAAGCAG CGTCGAGAA 1259 
GTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCAC 1453 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIII 1 1 1 1 

GTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTGTTCAT 1319 
TTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTA 1513 

i iii ii ii: ;i M ii Mi ii ii iiin ii ii in 

TTAAGCCGTATATTGAAGAAAACTGTGTATAACGAGATGGACAAGAACCGATGTGAATTA 1379 
CTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGT 1573 

INI Mill Mil INI Mil Mill MINIM Mill 1 1 1! I II II llllllllllll 

CTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGT 143 9 

ATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGC 1633 

IM Ml Ml Ml ill II Ml II II Ml MM II M II 

ATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGC 14 99 

CTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCATGAACGGAACA 1693 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II I 

CTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCATGAACGGAACA 1559 
AGCATCCAGTGGAAGAACCACGATCAAAACAACCACAACACA 1753 

IMIIIIIIIIIIM IIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

AGCATCCAGTGGAAGGACCACGATCAAAACAACCACAACACA 1619 
GACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAATCCTCTCGGAG 1813 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIMMIIIIIII 

GACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAATCCTCTCGGAG 1679 
AAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTTCTTCCTTAAT 1873 

M I M II M M 1 1 II 1 1 1 1 II 1 1 1 IMMMMMIMMMMMMMIMIIMM 

AAAAAAATCACAAGGCAACTGTGAGTCCGGGAATCTCTTCTCTGATCCTTCTTCCTTAAT 1739 
TCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAA - GGTAGACTGGTTTATCCA 1932 

II M! IM II II MM I! IM MM II II II I II I III I MM 

TCACTCCCACACCC^GAAGAAATGCTTTCOVAAACCGCAAGGGTAGACTGGTTTATCCA 1799 
CCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTCTGCGTGTTG 1992 

I > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTCTGCGTGTTG 1859 
TATTCAGCACTAAAAAATGGTGGGAG CTGGGGGAGAATGAAGACTGTTAAATGAAAC CAG 2052 

II II I II 1 1 MM MM III I II II I II I II 1 1 INI II II II Mill II II MM I II I 

TATTCAG CACTAAAAAATGGTGGGAG CTGGGGGAGAATGAAGACTGTTAAATGAAACCAG 1919 
AAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGGCTAGCTTTTATG 2112 

MM Mill MM MM I MM MM Mill llllllllllll I MM MM MM Ml I 

AAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATGG CTAGCTTTTATG 197 9 
GCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGATTAACGACA 2172 

M 'MM II Ml ! IMMM [| , M M IM II M MM I 

GCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGATTAACGACA 2 03 9 



Qy 
Db 


2173 
2040 


AGATTTTCTACTTTTTTTAAGTGA-TTTTTTGTCCTTCAGCCAAACACAATATGGGCTCA 

IIIMMIIIIIMIIIillllll 1 1 1 1 . II 1 1 1 1 ! 1 1 1 1 ' . 1 1 

AGATTTTCTACTTTTTTTAAGTGATTTTTTTGTCCTTCAGCCAAACACAATATGGGCTCA 


2231 
2099 


Qy 


2232 


GGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAATAGCCTAA 

MIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIMIIIIIIIIIIIIIMIMIIIII 

AGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAATAGCCTAA 


2291 


Db 


2100 


2159 


Qy 


2292 


C^TGATTATTTGAACTTATTTACACATAGTTTG-AAAAAAAAAAGACAAAAATAGTATTC 

1, II ,11 II ill M II ,1 lllllllllllillllllllllllll 

CATGATTATTTGAA.CTTATTTACACATAGTTTGAAAAAAAAAAAGACAAAAATAGTATTC 


2350 


Db 


2160 


2219 


Qy 


2351 


AGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACACAAATTC 

Ml MM II lllllllllllillllllllllllll 1. ,11 II II II 1 

AGGTGAGCAATTAGATTAGTATTTTCCACGTCACTGTTTATTTTTTTAAAACACAAATTC 


2410 


Db 


2220 


2279 


Qy 


2411 


TAAAGCTAC^C^U^TACTACAGGCCCTTAAAGCAC^GTCTGATGACACATTTGGCAGTT 

IlillllllMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIII 

TAAAG CTACAACAAATACTACAGG CCCTTAAAG CACAGT CTGATGACACATTTGG CAGTT 


2470 


Db 


2280 


2339 


Qy 


2471 


TAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATGGTGTTTT 

MMIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

TAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATGGTGTTTT 


2530 


Db 


2340 


2399 


Qy 


2531 


ATTACAAGGGACCTTGAA f CATGTTTTGTATGTTAAATTCAAAAGTAATGCTTCAATCAGA 

1 M M 1 II 1 1 II II 1 1 II 1 II II 1 II II II 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 II II 1 

ATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTCAATCAGA 


2590 


Db 


2400 


2459 


Qy 


2591 


TAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAAAATCAATG 

MIIIIIIIIIIIIIIIIIIIM IMIMIIIIIIIIMIIIIMIIIIIIIIIMIII 

TAGTTCTTTTTCACAAGTTCAAT - CTGTTTTTCATGTAAATTTTGTATGAAAAATCAATG 


2650 


Db 


2460 


2518 


Qy 


2651 


TCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCAGTGCACT 

MMIMMMMIMMMMIIMMMMIMMMMMMMMMMMMMI 

TCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCAGTGCACT 


2710 


Db 


2519 


2578 


Qy 


2711 


GTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATGATTCGGA 

MMIIMMIMMMIMIMMMMIMMMMIIIMIMMIMMMM II 

GTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATGATTCAGA 


2770 


Db 


2579 


2638 


Qy 


2771 


AATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCA 2822 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 II 1 1 1 1 
AATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCA 2 690 




Db 


2639 





RESULT 11 

BD169895 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



BD169895 2595 bp DNA linear PAT 17-JAN-2003 

Method for examination of allergosis. 

BD169895 

BD169895.1 GI:27875707 
WO 02052006-A/28 . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



1 (bases 1 to 2595) 

Otani,N., Matsui,K., Yoshida,N., Sugita,Y. and Izuhara,K. 
Method for examination of allergosis 
Patent: WO 02052006-A 28 04 - JUL-2002 ; 

GENOX RESEARCH INC,NORIKO OTANI , KEIKO MATSUI,NEI YOSHIDA, YUJI 
SUGITA, KEN J I IZUHARA 
OS Homo sap i ens ( human ) 
WO 02052006-A/28 
04-JUL-2002 

21-DEC-2001 WO 2001JP011287 
26-DEC-2000 JP OOP 396166 

NORIKO OTANI, KEIKO MATSUI,NEI YOSHIDA, YUJI SUGITA, KENJI PI 
IZUHARA 

PC C12N15/12,C12Q1/68,C12Q1/02,A01K67/027,A61K3 1/713, A61K45/00, 
A61K48/00, 

A61P37/08,G01N33/l5,G01N33/50,G01N33/53 
Method for examination of allergosis 
Key Locat ion/ Qual i f i ers 

source 1. .2595 

/organism= 'Homo sapiens (human)*. 
Locat ion/ Qual i f i ers 
1. .2595 

/organism="Homo sapiens" 
/mol_type= "genomic DNA" 
/db_xref = M taxon: 9606" 
788 a 478 C 459 g 870 t 



PN 
PD 
PF 
PR 
PI 



PC 
PC 
CC 
FH 
FT 
FT 



Query Match 60.2%; 
Best Local Similarity 100.0% 
Matches 2481; Conservative 



Score 2470; DB 6; 
Pred . No . 0 ; 
0; Mismatches 0; 



Length 2 5 95; 



Indels 



1 ; Gaps 



1; 



Qy 



Db 



1624 CCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCAT 1683 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIMIIIIIIIII 

15 CCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCX3GTCCCCAT 74 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 



1684 GAAGjKIAACAAGCATCCAGTGGAAGAACCACGATC^ 1743 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 

75 GAACGGAACAAGCATCCAGTGGAAGAACCACGATC 134 
1744 CAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAAT 1803 

M, II MIM II' III Ml III II Ml II Ml MM MM 

135 CAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAAT 194 

1804 CCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTT 1863 

! MM:Mi'MU II II MMMMM II! MM III MM'MI 

195 CCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTT 254 

18 64 CTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCC^AAACCGCAAGGTAGACTG 1923 

II 1 1 HI Mill MM 1 1 III 1 1 1 1! MM! III! I INI I MM Mill INI MM 

255 CTTCCTTAATTC^CTCCCACACCC^GAAGAAATGCTTTCC^AAACCGCAAGGTAGACTG 314 

1924 GTTTATC(^CCCA.C^ACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTC 1983 

IIMIIIIIIirilMIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIMIIIIIMII 

315 GTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTC 374 

1984 TGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAA 2043 



Db 


375 


Qy 


2044 


Db 


435 


Qy 


2104 


Db 


495 


Qy 


2164 


Db 


555 


Qy 


2224 


Db 


615 


Qy 


2284 


Db 


675 


Qy 


2344 


Db 


735 


Qy 


2404 


Db 


795 


Qy 


2464 


Db 


855 


Qy 


2524 


Db 


915 


Qy 


2584 


Db 


975 


Qy 


2644 


Db 


1034 


Qy 


2704 


Db 


1094 


Qy 


2764 


Db 


1154 


Qy 


2824 



Mil MM I Mil I Ml Ml 1 1 MM MM Mill MINIM Mil Mill II II MM 

TGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAA 434 
TGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGGCTA 2103 

MIMMMMMMIMMIIMMMMMMMIMMMMIMMMMMMM 

TGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGGCTA 4 94 
GCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGA 2163 

IIIIIIIIIIIIIIIIIIIMIIIIIMMIIMIIIMIIIIIIIMIIIIIIIIIIII 

GCTTTTATGGGAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGA 554 
TTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATA 2223 

IIIIIIIMIIIIMIIMIMIMIIIMMIIIIIIIMMIIIIMIMMIIIII! 

TTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATA 614 
TGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAA 2283 

MIIIIIIIIIIMMIIIMIIIIIIIIIIIIIIIIIMIIIIIMIIMIIIIMIII 

TGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAA 674 
TAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAAT 2343 

MIMMIIMMIMMIIIIIMMIIMIIMMIIIIMMIIMMMIMMII 

TAGCCTAACATGATTATTTGAAGTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAAT 734 
AGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACA 24 03 

I M. 'II III. II I IIUI II , II 'II Ml II M il 

AGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACA 794 
CAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACATTT 24 63 

M M 1 1 1 II II M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 

CAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACATTT 854 
GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 2523 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIMIIII 

GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 914 
GTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTC 2583 

M II II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 II 1 1 II M I II II II II 1 1 II I II 1 1 1 M 1 1 

GTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTC 974 
AATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAAA 2643 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 ]] 1 1 1 1 1 ! I J ! 1 1 1 ] 1 1 1 

AATCAGATAGTTCTTTTTCACAAGTTCAAT - CTGTTTTTCATGTAAATTTTGTATGAAAA 1033 
ATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCA 2703 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCA 1093 
GTG CACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTT CAGATG 2763 

IIIIMIIIIIIIMIMIIIMIMIIIIIIIIIMIIIIIIirillllllllllllll 

GTGCACTGTATATAGAAGT CTAAAACACAC CTAAGAGAAAAAGATCGAATTTTTCAGATG 1153 
ATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCAC 2823 

M I II II 1 1 II II II 1 1 II 1 1 II 1 1 1 1 1 1 II 1 1 II II 1 1 II 1 1 II 1 1 M 1 1 II I II I M I 

ATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCAC 1213 
CTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTC 2883 

IIIMIIIIIIIIIIMIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIM 



Db 



1214 CTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTC 1273 



Qy 


2884 


Db 


1274 


Qy 


2944 


Db 


1334 


Qy 


3004 


Db 


1394 


Qy 


3064 


Db 


1454 


Qy 


3124 


Db 


1514 


Qy 


3184 


Db 


1574 


Qy 


3244 


Db 


1634 


Qy 


3304 


Db 


1694 


Qy 


3364 


Db 


1754 


Qy 


3424 


Db 


1814 


Qy 


3484 


Db 


1874 


Qy 


3544 




1934 


Qy . 


3604 


Db 


1994 


Qy ' 


3664 


Db 


2054 



CCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAA 2 943 

! 1 1 1 1 ! 1 1 1 ! I ! 1 1 1 1 1 1 1 1 1 1 M 1 1 11 1 1 II > 1 1 i 1 1 1 i 1 1 1 1 1 F 1 1 1 1 1 1 i 1 1 1 1 1 1 1 

CCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAA 1333 
AAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCA 3003 

II MM MMI IM II Ml II III I Ml IM M II II 

AAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCA 1393 
GTATGAACCTAACTCCCCACCCCAACATCTCCCTCCC^CATTGTCACCATTTC^AAGGGC 3 063 

I Mil Illllll III I MM MM llllllll Illllll llllllll INI llllllll I 

GTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAGGGC 14 53 
CCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAA 3123 

MMMIMMIMMMMMMMMMMMIMMMMMMMIIMMIMM 

CCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTAGAGACTGTGAGTACAGCAGAAA 1513 
ATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATT 3183 

II' II : I I ' 1 1 'II 'II II I II ' j l ill II!. 

ATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATT 1573 
TTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGT 3243 

mmmmmmmmmmmmmmmmmmmmmmimmmmmimm 

TTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGT 1633 
ATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTC 3303 

Ml II ,11-1 I, II Ml II Ml ih !M II II II! II 

ATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTC 1693 
TGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGT 3363 

M M MM I IMMM I! Ml II, I ; I III IMIMI III 

TGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGT 1753 
AGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGA 3423 

II MM II IM M iM II IM II M <l Ml II II IM 

AGCAAC CTTCTG CATT CATAAAT CTTGTAATCATGTTACCATTACAAATGGGATATAAGA 1813 
GGCAG CGTGAAAG CAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGTTGG 3483 

Ml M IM i MM II Ml II Ml II I Ml MM IIMM 

GGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGG 1873 
TTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACAC 3543 

I II 1 1 II I II 1 1 1 II II I II II II I II 1 1 1 1 II I II II II 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 

TTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACAC 1933 
TTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGT 3603 

I Mill Illllll I II llllllll llllllll MMMMMMMMMIMM Mill 

TTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGT 1993 
CTGATATTTCTTT CAGACTTCG CCAGACAGATTG CTGATAATAAATTAGGTAAGATAATT 3663 

IIIIIMMIIIIIMIIIIIIIIIMIMIIIIIIIIIIIMIIIIMIIIIIIIIIII 

CTGATATTTCTTT CAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATT 2053 
TGTTGGGCC^TATTTTAGGAC^GGTAAAATAACATC^GGTTCC^GTTGCTTGAATTGC^ 3723 

Mi MM I MM IMIMI MM MM II II Ml MM llllllll Mill 

TGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGCAA 2113 



Qy 3724 GGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCAT 3783 

M M Ml Ml Mil MINN M Ml' III III I II 111 

Db 2114 GGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCAT 2173 

Qy 3 784 CATATG CAGTGATATATG CCTATAATATAAG GCATAGGTT CACACCATTTTGTTTAGACA 3843 

Ml Ml Ml MM III II II IMMM IM MM' II 

Db 2174 CATATG CAGTGATATATG CCTATAATATAAG CCATAGGTT CACACCATTTTGTTTAGACA 2233 

Qy 3844 ATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATT 3 903 

I 1 1 1 I I I I I I I I I I I M I I II I I I I I I II I I ! I I I I I I I I I I I I II I I I I I I I I I I | || I 
Db 2234 ATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATT 22 93 

Qy 3904 CAGAAAGT CATAGATTTCTGAAGG CGTCAACGTG CATTTTATTTATGGACTGGTAAGTAA 3 963 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 22 94 CAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAA 2353 

Qy 3 964 CTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGT 4 023 

IIIMIIMIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIII 

Db 2354 CTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGT 2413 

Qy 4 024 AACTTTGTAGAAATGAG CCAGAAG CCAAGG CCCTGAGTTGG CAGTGG CCCATAAGTGTAA 4 083 

IMMIM IM II MM Ml M M MIM II Mil Ml 

Db 2414 AACTTTGTAGAAATGAG CCAGAAG CCAAGG CCCTGAGTTGG CAGTGG CCCATAAGTGTAA 24 73 

Qy 4 084 AATAAAAGTTTACAGAAACCTT 4105 

IMIMIIIIIMIIIIMIII 

Db 2474 AATAAAAGTTTACAGAAACCTT 24 95 
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FEATURES 
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CDS 



intron 
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Location/Qualifiers 
1. .2595 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9606" 
/chromosome^ " 4 " 

/clone_lib=" lambda EMBL3 and lambda EMBL3SP6/T7" 

join (Dll 145 . 1 : 92 . . 511 , Dl 1146 .1:21. . 14 8,D1114 7. 1 : 21 . .219, 

D11148.1:21. . 173,D1114 9.1:21. -154,011150.1:21. .129, 

19. .159) 

/codon_start=l 

/product = " endothel in-A receptor" 
/protein_id="BAA01920. 1" 
/db_xref="GI : 219630" 

/translation^ "METLCLRASFWLALVGCVISDNPERYSTNLSNHVDDFTTFRGTE 
LSFLVTTHQPTNLVLPSNGSMHNYCPQQTKITSAFKYINTVISCTIFIVGMVGNATLL 
RIIYQNKCMRNGPNALIASLALGDLIYWIDLPINVFKLLAGRWPFDHNDFGVFLCKL 
F P FLQKS S VG I TVLNLCALS VDR YRAVASWS R VQG I G I PLVTA IEIVSIWILSFI LA I 
PEAIGFVMVPFEYRGEQHKTCMLNATSKFMEFYQDVKDWWLFGFYFCMPLVCTAIFYT 
LMTCEMLNRRNGSLRIALSEHLKQRREVAKTVFCLWIFALCWFPLHLSRILKKTVYN 
EMDKNRCELLSFLLLMDYIGINLATMNSCINPIALYFVSKKFKNCFQSCLCCCCYQSK 
SLMTSVPMNGTS I QWKNHDQNNHNTDRSSHKDSMN " 
<1. .18 
/number=7 
19. .2495 
/number=8 
2474. .2479 
a 478 c 459 g 870 t 



Query Match 60.2%; Score 2470; DB 9; Length 2595; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2481; Conservative 0; Mismatches 0; Indels 1; Gaps 



1; 



Qy 



Db 



1624 CCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCAT 1683 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

15 CCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCAT 74 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 



1684 GAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACCA 1743 

MIIIIIMIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIM 

75 GAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACA^ 134 



1803 



1744 C^GC(^TAAGGAC^GC^TGAACTGACCACCCTTAGAAGaVCTCCTCGGTACTCCCATAAT 

MMIIIIIIIIIMIIIIIIIIIIIIIIIIIIJIIIIIIIIIIIIIIlllllMlllll 

135 CAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCC^TAAT 194 



18 04 CCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTT 1863 

M I I I I I I I I I I I I I I I I I I I I l I. I l l I I I I | I I i i i | i i | | | i | | | i | | i i i i i i | | | | 

195 CCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTT 254 
18 64 CTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGC^ 1923 

IIMIIMIIIIIMIIIIIIIIIIIIIIIIIMIIIIMIlllllllllMIIIIIIII 

255 CTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGTAGACTG 314 
1924 GTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTC 1983 

IIIIIIIIIMIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



Db 


315 


Qy 


1984 


Db 


375 


Qy 


2044 


Db 


435 


Qy 


2104 


Db 


495 


Qy 


2164 


Db 


555 


Qy 


2224 


Db 


615 


Qy 


2284 


Db 


675 


Qy 


2344 


Db 


735 


Qy 


2404 


Db 


795 


Qy 


2464 


Db 


855 


Qy 


2524 


Db 


915 


Qy 


2584 


Db 


975 


Qy 


2644 


Db 


1034 




A / U *± 


Db 


1094 


Qy 


2764 


Db 


1154 



315 GTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTC 374 
TG CGTGTTGTATTCAG CACTAAAAAATGGTGGGAG CTGGGGGAGAATGAAGACTGTTAAA 2 043 

IMIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 

TG CGTGTTGTATTCAG CACTAAAAAATGGTGGGAG CTGGGGGAGAATGAAGACTGTTAAA 434 
TGAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATGG CTA 2103 

MM Mill II II I III MM Mill HIM III I Mil II II INI I MM INI MM 

TGAAAC CAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATGGCTA 4 94 
GCTTTTATGGQVGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGA 2163 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIII 

GCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGA 554 
TTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAACACAATA 2223 

M 1 1 II I II 1 1 1 II I II 1 1 II 1 1 II II II 1 1 1 1 II II 1 1 1 1 1 II 1 1 1 II 1 1 M 1 1 1 1 II I 

TTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTGTCCTTCAGCCAAAC^CAATA 614 
TGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAA 22 83 

IIMIIIIIIIIMIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIII 

TGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATAA 674 
TAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAAAT 2343 

M 1 1 II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 II I II 1 1 1 II II 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 

TAGCCTAACATGATTATTTGAACTTATTTAC^CATAGTTTGAAAAAAAAAAGACAAAAAT 734 
AGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACA 24 03 

MIMMMMMIMMMIMIMIMMMMMMMIMMMIMMMIMM 

AGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAACA 794 
CAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGACACATTT 2463 

II Ml II II II INI II II I II MM II Ml 1 1 II II I! MM! 1 1 MM MINI MM 

CAAATTCTAAAGCTACAACAAATACTACAGGCCCT^ 854 
GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 2523 

II I II 1 1 1 M 1 1 II 1 1 1 1 II I II 1 1 1 1 1 1 1 M I II 1 1 1 1 1 1 1 1 II I II 1 1 1 II I II II 1 1 

GGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAATG 914 
GTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTC 2583 

1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II 1 1 1 M I 

GTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTTC 974 
AATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAAA 2643 

MIMIIIIIIMIIIIIIIIIIillllll Ml II II II II MM | 

AATCAGATAGTTCTTTTTCACAAGTTCAAT - CTGTTTTTCATGTAAATTTTGTATGAAAA 1033 
ATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCA 2703 

I M II 1 1 1 1 1 1 1 II 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 ll || 1 1 1| 1 1 1 1 1 1| 1 1 1 - 

ATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTCA 10 93 
GTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATG 2763 

!M Ml Ml M 'M II II II MMMM M.MI II M 

GTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGATG 1153 
ATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCAC 2823 

M M I M M II 1 1 M 1 1 1 M I II I II II 1 1 1 II II M II 1 1 1 1 1 1 1 1 M II I II II 1 1 II 

ATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATCAC 1213 



i 



Qy 


2824 


Db 


1214 


Qy 


2884 


Db 


. 1274 


Qy 


2944 


Db 


1334 


Qy 


3004 


Db 


1394 


Qy 


3064 


Db 


1454 


Qy 


3124 


Db 


1514 


Qy 


3184 


Db 


1574 


Qy 


3244 


Db 


1634 


Qy 


33 04 


Db 


1694 


Qy 


3364 


Db 


1754 


Qy 


3424 


Db 


1814 


Qy 


3484 


Db 


1874 


Qy 


3544 


Db 


1934 


Qy 


3604 


Db 


1994 



CTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTC 2 883 

m:m ii ji :i :ii :n II II II, ii mi mi i 

CTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCATTC 1273 

CCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAA 2943 

I I M I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I! I I I I I I I I I I I I I I I I 
CCTTTTC CATATAGGAAACATAATTTTGAAGTGG C CAGATGAGTTTAT CATGTCAGTGAA 1333 

AAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCA 3 003 

IN MMMI MMMI Ml Ml II II M, III II I! II 

AAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCA 13 93 
GTATGAACCTAACTCCCCACCCC^CATCTCCCTCCCACATTGTC^CCATTTCAAAGGGC 3063 

I Nil IN II II M Mi IN INIINMIM IN III ! 

GTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAGGGC 14 53 
CCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAA 3123 

1 1 1 1 1 1 1 : ! I ' I i ' I i II 1 1 .ii I ] M , 

CCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAA 1513 
ATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATT 3183 

i ^ii ii m m ii ii mi ii ii mi in :i i ,i 

ATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATT 1573 
TTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGT 3243 

IMIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIMMIIIIIMIIIIIIIIIIIMI 

TTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGT 1633 
ATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTC 33 03 

I : II M .11 MM II II IN II' !!■ II II III I 

ATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTC 1693 
TGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGT 3363 

MIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIII 

TGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCTGT 1753 
AGCAACCTTCTGCATTCATAAATCTTGTAATGATGTTACCATTACAAATGGGATATAAGA 3423 

I il II IN I! U M' II' II IIMUI 'II Ml 'M 

AGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGA 1813 
GGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGG 34 83 

M, II 11 III, N II II ll IN II II , IN IN: I 

GGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGG 1873 
TTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACAC 3543 

M IM'IIMI , II II, IM 1 II III M Ml, III Mi II 

TTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACAC 1933 

TTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGT 3603 

I 1 1 M II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I II I I I I I I | | M 
TTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGT 1993 

CTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATT 3663 

MMIIIMIIIIIIMIIIIIIIIIIIMIIIIMIIIIIIillMIIMIIIIIIIM 

CTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATT 2053 



Qy 3664 TGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGC^ 3723 

MMI M M ill Ml Ml III: II Ml M MM mmm 

Db 2 054 TGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGCAA 2113 

Qy 3724 GG CTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCAT 3783 

M M 11 II III IM II III Ml II II II II M 

Db 2114 GGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCAT 2173 

Qy 3784 CATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAGACA 3843 

MM II I II MM II II III II II MM MM I II II II II II MM I II II MM II II 

Db 2174 CATATG CAGTGATATATG CCTATAATATAAGCCATAGGTTCACAC CATTTTGTTTAGACA 2233 

Qy 3844 ATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATT 3 903 

IIIIIMMIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIMIMI 

Db 2234 ATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATT 2293 

Qy 3 904 CAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAA 3 963 

MMMMMIMM MMMMMMMMMMMMMMMMMMMMIMM 

Db 22 94 CAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAA 2353 

Qy 3 964 CTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGT 4 023 

I M I M M I II 1 1 1 1 M II II I II 1 1 M 1 1 1 1 1 II 1 1 M M II I M 1 1 II II II I M I II 

Db 23 54 CTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGT 2413 

Qy 4 024 AACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAA 4 083 

IM IM Ml II MM II M M I! Ml MM. II IM M 

Db 2414 AACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAA 24 73 

Qy 4 084 AATAAAAGTTTACAGAAACCTT 4105 

1 1 1 1 i 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 2474 AATAAAAGTTTACAGAAACCTT 24 95 

RESULT 13 
AC093908 

LOCUS AC093908 164920 bp DNA linear. PRI 01-MAR-2002 

DEFINITION Homo sapiens BAC clone RP11-752L20 from 4, complete sequence. 

ACCESSION AC093908 AC067873 

VERSION AC093908.3 GI : 18497272 

KEYWORDS HTG . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

REFERENCE 1 (bases 1 to 164920) 

AUTHORS Sulston,J.E. and Waterston,R. 

TITLE Toward a complete human genome sequence 

JOURNAL Genome Res. 8 (11), 1097-1108 (1998) 

MEDLINE 99063792 

PUBMED ■ 9847074 

REFERENCE 2 (bases 1 to 164920) 

AUTHORS Paulson, E., Cotton, M. and Creason,K. 

TITLE The sequence of Homo sapiens BAC clone RP11-752L20 

JOURNAL- Unpublished (2001) 

REFERENCE 3 (bases 1 to 164920) 

AUTHORS Waters ton , R . H . 

TITLE Direct Submission 



JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



Submitted (10-SEP-2001) Genome Sequencing Center, Washington 
University School of Medicine, 4444 Forest Park Parkway, St. Louis, 
MO 63108, USA 

4 (bases 1 to 164920) 
Waterston,R.H. 
Direct Submission 

Submitted (05-FEB-^2002) Genome Sequencing Center, Washington 
University School of Medicine, 4444 Forest Park Parkway, St. Louis, 
MO 63108, USA 

5 (bases 1 to 164920) 
Waterston,R. 

Direct Submission 
Submitted (08-FEB-2002) 



Washington 
Missouri 63108, USA 



Washington 
Missouri 63108, USA 



Department of Genetics, 
University, 4444 Forest Park Avenue, St. Louis, 
6 (bases 1 to 164920) 
Waterston,R. 
Direct Submission 

Submitted (01-MAR-2002) Department of Genetics, 
University, 4444 Forest Park Avenue, St. Louis, 
On Feb 5, 2002 this sequence version replaced gi: 15778805. 
Genome Center 

Center: Washington University Genome Sequencing Center 

Center code: WUGSC 

Web site: http://genome.wustl.edu/gsc 
Contact : sapiens@watson. wustl .edu 

Summary Statistics 

Center project name: H_NH0752L2 0 
Drafting Center: WIBR 



NOTICE: This sequence may not represent the entire insert of this 
clone. It may be shorter because we only sequence overlapping 
clone sections once, or longer because we provide a small overlap 
between neighboring data submissions. 

This sequence was finished as follows unless otherwise noted: 
all regions were double stranded, sequenced with an alternate 
chemistry, or covered by high quality data (i.e., phred quality >= 
30) ; an attempt was made to resolve all sequencing problems, such 
as compressions and repeats; all regions were covered by sequence 
from more than one subclone; and the assembly was confirmed by 
restriction digest. 

MAPPING INFORMATION: 

Mapping information for this clone was provided by Dr. John D. 
McPherson, Department of Genetics, Washington University, St. Louis 
MO. For additional information about the map position of this 
sequence, see http://genome.wustl.edu/gsc 



SOURCE INFORMATION: 

The RPCI-11 human BAC library was made from the blood' of one male 
donor, as described by Osoegawa,K., Woon,P.Y., Zhao,B., Frengen,E., 
Tateno,M., Catanese, J. J. and de Jong, P.J. (1998) An improved 
approach for construction of bacterial artificial chromosome 
libraries. Genomics 51:1-8. The clone may be obtained either from 
Research Genetics, Inc. (http://www.resgen.com) or Pieter de Jong 
and coworkers at http://www.chori.org 



VECTOR: pBACe3.6 



NEIGHBORING SEQUENCE INFORMATION: 
. The clone sequenced to the right is RP11-56F3. Actual start of 
this clone is at base position 1 of RP11-752L20 ; actual end is at 
base position 164920 of RP11-752L20. 

Data from AC083898 was used to finish this clone, AC093908. 

The sequence of AC067873 has been incorporated into AC093908. 
FEATURES Location/Qualifiers 
source 1. .164920 

/organism= M Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref="taxon: 9606" 
/chromosome^ " 4 " 
/map="4 M 

/clone="RPll-752L20" 
/ clone_l ib- " RPCI - 1 1 " 
3. .112 

/rpt_family= H Alu" 
323. .393 
/rpt_family= "LI " 
4328. .4359 
/rpt_family="AT_rich" 
4786. .4811 
/rpt_family=" (CA)n" 
4813. .4874 
/rpt_family=" (TC)n" 
4883. .4910 
/rpt_family=" (CA)n" 
4948. .5511 
/rpt_family="Ll" 
5058. .5099 
/rpt_family="AT_rich" 
5129. .5164 
/rpt_family= "AT_rich" 
5149. .5154 

/note="similar to Homo sapiens EST AW150086 
•xg36f 05 .xl" 
5837. .5927 

/rpt_family="MERl_type" 
5930. .6038 
/rpt_family="L2 " 
6126. .6196 

/ rpt_f ami ly= " MERl_type " 
6261. .7668 
/rpt_family= " L2 " 
6671. .6698 
/rpt_family=" (TC)n" 
7297. .7299 

/note="similar to Homo sapiens EST BE857602 
(NID:gl0371791) " " 
7694. .8060 
/rpt_family=" ERVL" 
8064. .8089 
/rpt_family=" (T)n n 



repeat_region 
repeat_region 
repeat_region 
repeat_region 
repeat_region 
repeat_region 
repeat__region 
repeat_region 
repeat_region 
misc_f eature 

repeat_region 
repeat_region 
repeat_region 
repeat_region 
repeat_region 
misc_f eature 

r epea t_reg i on 
repeat_region 



NID:g6198078] 



misc_f eature 8077. .8089 

/note="similar to Homo sapiens EST BG284616 

(NID:gl3035741) " 
misc_feature 8077.. .8087 

/note="similar to Homo sapiens EST BE139388 (NID : g8601888) 

xr69h04.xl" 
repeat_region 8274. .8724 

/rpt_family= "L2 " 
repeat_region 8848. .9191 

/rpt_family="Ll M 
repeat_region 9177. .9211 

/rpt_f amily= " AT_r ich" 
misc_feature 9209. .9211 

/note="similar to Homo sapiens EST AA609692 (NID: g2458120) 

afl6f01.sl" 
misc_feature 9221. .9903 

/note="similar to Sus . scrofa EST BI401419 (NID:gl5180480) " 
misc_f eature 9221. .9526 

/note="similar to Homo sapiens EST R40029 (NID: g820778 ) 

yf 68e04 . sl M 
misc_feature 9221. .9403 

/note="similar to Homo sapiens EST AI933963 (NID : g5672833 ) 

wn95b05.xl" 
misc_f eature 9222. .9438 

/note="similar to Homo sapiens EST BE139388 (NID:g8601888) 

xr69h04.xl" 
misc_f eature 9226. .9859 

/note="similar to Homo sapiens EST BI113803 

(NID:gl4564704) " 
mi-sc_f eature 9234. .9822 

/note=" similar to Homo sapiens EST BE857602 

(NID:gl0371791) » 
misc_feature 9234. .9679 

/note="similar to Homo sapiens EST W67762 (NID:gl376634) 

zd37fll.rl" 
misc_f eature 9234. .9620 

/note="similar to Homo sapiens EST AI342876 (NID : g4080082) 

qo32a04.xl" 
misc_feature 9234. .9436 

/note="similar to Homo sapiens EST AA609692 (NID :g2458120) 

afl6f01.sl" 
misc_feature 9234. .9436 

/note= M similar to Homo sapiens EST AI969686 (NID :g5766504 ) 

wz70d08 .xl" 
misc_feature 9234. .9436 

/note="similar to Homo sapiens EST AW150086 (NID:g6198078) 

xg36f05.xl" 
misc_f eature 9253. .9722 

/note="similar to Bos taurus EST BI536408 (NID : gl5377518 ) " 
misc_f eature 9295. .10091 

/note="similar to Homo sapiens EST BG284616 

(NID:gl3035741) « 
misc_f eature 9300. .9802 

/note="similar to Homo sapiens EST BF569074 

(NID:gll642454) » 
misc_f eature 9301. .10099 

/note="similar to Homo sapiens EST BG492199 



<NID:gl3453711) " 
misc_feature 9302. .10048 

/note="similar to Homo sapiens EST BG683630 

(NID:gl3915027) " 
misc_feature 9423. .10120 

/note="similar to Homo sapiens EST BG824910 

(NID:gl4172497) " 

Query Match 58.7%; Score 2409.2; DB 9; Length 164920; . 

Best Local Similarity 99.4%; Pred. No. 0; 

Matches 2470; Conservative 0; Mismatches 8; Indels 6; Gaps 5; 
Qy 1624 CCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCAT 1683 

IMMMMMMMMMMMMMMMMMMMIMMMMMMMMMM 

Db 93358 CCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCAT 93417 

Qy 1684 GAACGGAACAAGCATCCAGTGGAAGAACCACGATCAAAACAACCAC^ 1743 

IINIIMIIIIIIIIMIIIIIIIIIMIIilMIIIIIIIIIIIIIIIIIIIIIMM 

Db 93418 GAACGGAAC^GCATCCAGTGGAAGAACC^CGATCAAAACAACCAC^CACAGACCGGAG 93477 

Qy 1744 CAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAAT 1803 

MIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIMIIIIIIIIIIIIMIIMIIMM 

Db 93478 CAG C CATAAGGACAG CATGAACTGACCACC CTTAGAAGCACTCCTCGGTACTCC CATAAT 93537 

Qy 18 04 CCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATCCTT 18 63 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIII IMIIIMMIIIIIIIIIMIIII ' 

Db 93538 CCTCTCGGAGAAAAAAATCACAAGGCAACTGTGAGTCCGGGAATCTCTTCTCTGATCCTT 93 5 97 

Qy 1864 CTTCCTTAATTC^CTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAA-GGTAGACT 1922 

IIIIIMIIMIIIIIIIIIMIIIIIMIIIMIIIIIIIMIIIIIIII II 1 1 1 1 II 

Db 935 98 CTTCCTTAATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGGTAGACT 93657 

Qy 1923 GGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAAtTTACATATT 1982 

I Mill Mil Mill I II I II II Mllllllll MM Mllllllll II II INI I MM 

Db 93658 GGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATT 93717 

Qy 1983 CTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAA 2 042 

MIMIIIMMIIIIIIIHIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMI 

Db 93718 CTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAA 93777 

Qy 2 043 ATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGGCT 2102 

' I M I II II II II I II II II II I II II II I MM II II I MM II III 1 1 II II II I II II 

Db 93778 ATGAAAC CAGAAGGATATTTACTACTTTTG CATGAAAATAGAGCTTT CAAGTACATGG CT 93837 

Qy 2103 AGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAG 2162 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 

Db 93838 AGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAG 938 97 

Qy 2163 ATTAACGACAAGATTTTCTACTTTTTTTAAGTGA - TTTTTTGTCCTTCAGCCAAACACAA 2221 

I i 1 1 1 1 1 1 1 1 1 MINIM Mllllllll MM I IIMIIMIIIIIIIIMIIMIII 

Db 938 98 ATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTTGTCCTTCAGCCAAACACAA 93957 

Qy 2222 TATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCAT 2281 

M M 1 1 1 1 II II 1 1 1 1 II 1 1 1 II ll 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II I II 1 1 1 1 1 II 1 1 1 1 1 1 1 

Db 93 958 TATGGGCTCAAGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCAT 94 017 

Qy 2282 AATAGCCTAAO\TGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAA 2341 



Db 


94018 


Qy 


2342 


Db 


94078 


Qy. 


2402 


Db 


94138 


Qy 


2462 


Db 


94198 


Qy 


2522 


Db 


94258 


Qy 


2582 


Db 


94318 


Qy 


2642 


Db 


94377 


Qy 


2702 


Db 


94437 


Qy 


2762 


Db 


94497 


Qy. 


2822 


Db 


94557 


Qy 


2882 


Db 


94617 


Qy 


2942 


Db 


. 94677 


Qy 


3002 


Db 


94735 


Qy 


3062 


Db 


94795 


Qy 


3122 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAAA 94 077 
ATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAA 24 01 

M I! II Ml II M II Mi M Ml Ml IIIIIIIIIIIIMI 

ATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTGTTTATTTTTTTAAAA 9413 7 

CACAAATTCTAAAGCTACAAC^^TACTAC^GGCCCTTAAAGCACAGTCTGATGACACAT 24 61 

I I I I M I I I I I M I I I I I I I I I I I I I I I II II II I I I I I I I II I I I I I I I I II I I II I I I 
C^CAAATTCTAAAGCTAC^C^AATACTACAGGCCCTTAAAGCACAGTCTGATGACACAT 94197 



M M MM I M Ml MMM II MIM II Ml IhMM I 



TGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCT 2 581 

I II II II II II II I II II MM II MM II MM II III Ml III II MINI II MM I 

TGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCT 94317 
TCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAA 2 641 

Ml MM Mill INI MM MM MM MM III MINIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TCAATCAGATAGTTCTTTTTCACAAGTTCAAT - CTGTTTTTCATGTAAATTTTGTATGAA 94376 
AAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTT 2701 

I M I M I II 1 1 1 1 1 1 1 1 1 1 II 1 1 II I II 1 1 M II 1 1 M I M M 1 1 II II II II 1 1 II 1 1 1 

AAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTT 94436 
CAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGA 2761 

1 1 1 1 1 1 1 1 1 M 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 ! 1 1 1 1 1 1 1 1 ! 1 1 M I 

C^GTGC^CTGTATATAGAAGTCTAAAACAC^CCTAAGAGAAAAAGATCGAATTTTTCAGA 944 96 
TGATT CGGAAATTTT CATT CAGGTATTTGTAATAGTGACATATATATGTATATACATATC 2821 

I M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II II II II 1 1 1 1 1 1 1 1 1 II II II M II II II 1 1 1 1 1 

TGATTCAGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGTATATACATATC 94556 
ACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCAT 2881 

MMMMMMMMIIIMI I iM II MMMI II IMMI Ml 

ACCTCCTATTCTCTTAATTTTTCTTAAAATGTTAACTGGCAGTAAGTCTTTTTTGATCAT 94 616 

TCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTG 2 941 

I M M M I I II II I II I I I I II I I M II I I I I II II I I I I I M I II I I I I i I I I II II I I 
TCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAGATGAGTTTATCATGTCAGTG 94 676 

AAAAATAATTACCCACAAATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTT 3 001 

M M 'Ml II Ml Ml II I I M II II II 1 1 1 II I II II II I II 1 1 

AAAAATAATTACCCACAAATGCCACCAG - -AACTTACGATTCTTCACTTCTTGGGGTTTT 94734 
CAGTATGAACCTAACTCCCC^CCCC^CATCTCCCTCCCACATTGTCACCATTTCAAAGG 3 061 

M 1 1 II 1 1 1 1 II 1 1 II II II I II 1 1 1 1 II II II I M I II II II 1 1 II 1 1 II II 1 1 1 1 1 II 

CAGTATGAACCTAACTCCCC^CCCCAACATCTCCCTCCCACATTGTCACCATTTCAAAGG 94794 
GCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGA 3121 

IMIIIIIMIIIIIIMIIIIIMMMIMIIIMIIIMIIIIIIIIIIIIIIMII 

GCCCACAGTGACTTTTGCTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGA 94 854 
AAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCA 3181 

M i M M M I! M II IMMIMIM IMMM IM I 



Db 


94855 


Qy 


3182 


Db 


94915 


Qy 


3242 


Db 


94975 


Qy 


3302 


Db 


95035 


Qy 


3362 


Db 


95094 


Qy 


3422 


Db 


95154 


Qy 


3482 


Db 


95214 


Qy 


3542 


Db 


95274 


Qy 


3602 


Db 


95334 


Qy 


3662 


Db 


95394 


Qy 


3722 


Db 


95454 


Qy 


3782 


Db 


95514 


Qy 


3842 


Db 


. 95574 


Ov 


3 902 


Db 


95634 


Qy 


3962 


Db 


95694 



94855 AAATCTTTTACTAGTGTGTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCA 94 914 
TTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATG 3241 

I II II I II II Ml MM MM II Mill MM I II II MINIMI II III II II II I II 

TTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATG 94 974 
GTATGTATGGATTTAATCTAATCTAATAATTGTG CCC CG CAGTTGTG CCAAAGTG CATAG 33 01 

I M Ml MM II II I MM MM I II MM III MUM 1 1 III II II IMIIII Mill 

GTATGTATGGATTTAATCTAAT CTAATAATTGTG C CCCGCAGTTGTGC CAAAGTG CATAG 95034 
TCTGAGCTAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCT 3361 

MMM 1 1 1 1 1 1 1 M I II II II M II I II M II M II I M I II II II I II II M II I II 

TCTGAG - TAAAATCTAGGTGATTGTTCATCATGACAACCTGCCTCAGTCCATTTTAACCT 95093 
GTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAA 3421 

I I M M II II I II I II II II M I II II M II II M II 1 1 1 1 1 II II M II M I II 1 1 M I 

GTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAA 95153 
GAGG CAG CGTGAAAG CAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGTT 34 81 

IIIIMIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

GAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTT 95213 
GGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGGAAAAGTCATTAC 3541 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTAC 95273 
ACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTT 3601 

MIMI IN Mi IN II ML. i! ill II II II Ml 

ACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTT 95333 
GTCTGATATTTCTTTCAGACTTCG C CAGACAGATTGCTGATAATAAATTAGGTAAGATAA 3661 

IIIIIIMIIMMIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GT CTGATATTTCTTTCAGACTTCG CCAGACAGATTG CTGATAATAAATTAGGTAAGATAA 953 93 
TTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGC 3721 

I Ml II MM MINIMI Mill INI I INI MM I MINIM I II II Mill MM 

TTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGC 954 53 

AAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGC 3781 

M I I M II II II II I II M II II I II II II I I I II II I II II II M | II II I II -I I I I II 
AAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGC 95513 

ATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAGA 3841 

MMIMMMIMM II Mill II II Mill III III MM MM Mill III MM II 

ATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAGA 95573 
CAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAA 3901 

MMIIMI IMIIII MIMM IMIIII II MM MIIMIMMIIIMIMM III 

CAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAA 95633 
TTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGT 3 961 

MMIMIIIIIMIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIMIII 

TTCAGAAAGTCATAGATTTCTGAAGG03TCAACGTGCATTTTATTTATGGACTGGTAAGT 95693 
AACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAA 4021 

M M ' M Ml .11 Ml 'II IM, II II .11 il ![ 

AACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAA 95753 



Qy 4 022 GTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGT 4 081 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 95754 GTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGT 95813 



Qy 4 082 AAAATAAAAGTTTACAGAAACCTT 4105 

INI MINIM MINIM I 

Db 95814 AAAATAAAAGTTTACAGAAACCTT 95837 



RESULT 14 

G06463 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



G06463 2337 bp DNA linear STS 19-OCT-1995 

human STS WI-722 6, sequence tagged site. 

G06463 

G06463.1 GI:859708 

STS; STS sequence; primer; sequence tagged site. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 2337) 
Hudson, T. 

Whitehead Institute/MIT Center for Genome Research; Physically 
Mapped ESTs 
Unpublished (1995) 

Contact : Thomas Hudson 

Whitehead Institute/MIT Center for Genome Research 

Whitehead Institute for Biomedical Research 

9 Cambridge Center, Cambridge MA 02142 USA 

Tel: 617 252 1900 

Fax: 617 252 1902 

Email: thudson@genome.wi.mit.edu 



Primer A: GATCGAATTTTTCAGATGATTCG 
Primer B: AAATGCCCAGCAAAAGTCAC 
STS size: 343 
PCR Profile: 

Presoak: 
Denaturation : 
Annealing: 56 degrees C 
Polymerization : 
PCR Cycles: 35 
Thermal Cycler: 
Protocol : 

Template: 10 ng 

Primer: each 5 pM 

dNTPs : each 4 nM 

Taq Polymerase: 0.025 units/ul 

Total Vol : 20 ul 



Buffer: 

MgC12: 1.5 mM 
KC1: 50 mM 
Tris-HCL: 10 mM 
pH : 9.3 



Prepared with primer pairs derived from D90348 Unigene. 
FEATURES Location/Qualifiers 
source l. .2337 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref = 11 1 axon: 9606" 

/map="713_C_7; 722_B_3; 744_F_5; 933_G_4; 950_A_4; 
827_H_10" 
STS 978. .1320 

primer_bind 978. .1000 

primer_bind complement ( 1301 . .132 0) 

BASE COUNT 694 a 407 c 407 g 784 t 45 others 

ORIGIN 

Query Match 55.8%; Score 2292; DB 11; Length 2337; 

Best Local Similarity 98.1%; Pred. No. 0; 

Matches 2292; Conservative 0; Mismatches 45; Indels 0; Gaps 0; 

CCACCCTTAGAAGCACTCCTCGGTACTCCCATAATCCTCTCGGAGAAAAAAATCACA^GG 1828 

IIMIIIIIMIIMIIIIIIIIMMIIIIIMMIIIIIIIIIIIIMIIIIIIIMI 

CCA.CCCTTAGAAGCACTCCTCGGTACTCCCATAATCCTCTCGGAGAAAAAAATCACAAGG 6 0 
CAACTGTGACTCCGGGAATCTCTTCTCTGATCCTTCTTCCTTAATTCACTCCCACACCCA 1888 

M 1 1 1 1 II 1 1 II I II II 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 M 1 1 

CAACTGTGACTCCGGGAATCTCTTCTCTGATCCTTCTTCCTTAATTCACTCCCACACCCA 12 0 
AGAAGAAATGCTTTCC^W^CCGC^GGTAGACTGGTTTATCCACCCACAACATCTACGA 194 8 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIMIIIIIIIMIIIII 

AGAAGAAATGCTTTCCAAAACCGCAAGGTAGACTGGTTTATCCACCCACAACATCTACGA 18 0 



Qy 


1769 


Db 


1 


Qy 


1829 


Db 


61 


Qy 


1889 


Db 


121 


Qy 


1949 


Db 


181 


Qy 


2009 


Db 


241 


Qy 


2069 


Db 


301 


Qy 


2129 


Db 


361 


Qy 


2189 


Db 


421 


Qy 


2249 


Db 


481 


Qy 


2309 



II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



2008 



ATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAATGAAACCAGAAGGATATTTACTACT 2068 

IIIIIMIMIMIIIIIIIIIIIMIIIIMIIMIIIMIIIIIIIIIIIIIIMIII 

ATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAATGAAACCAGAAGGATATTTACTACT 3 00 
TTTGCATGAAAATAGAGCTTTCAAGTACATGGCTAGCTTTTATGGCAGTTCTGGTGAATG 2128 

MIMMIMIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIMIIIIIIIMI 

TTTGCATGAAAATAGAGCTTTCAAGTACATGGCTAGCTTTTATGGCAGTTCTGGTGAATG 360 
TTCAATGGGAACTGGTCACCATGAAACTTTAGAGATTAACGACAAGATTTTCTACTTTTT 2188 

IIIIMIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIlllllll 

TTCAATGGGAACTGGTCACCATGAAACTTTAGAGATTAACGACAAGANNNNNNNNNNNNN 42 0 
TTAAGTGATTTTTTGTCCTTCAGCOU^CACAATATGGGCTCAGGTC^CTTTTATTTGAA 2248 

IIIMIIIIIIIIIIIMMIIIIIMIIIMMMIIIIIIMII 

NNNNNNNNNNNNNNGTCCTTCAGCCAAAC^C^ATATGGGCTCAGGTCACTTTTATTTGAA 4 8 0 
ATGTCATTTGGTGCCAGTATTTTTTAACTGCATAATAGCCTAACATGATTATTTGAACTT 2308 

MIMIIIIIIMIIIIIIIMIIMIIIIMIIIIIMMIIIIIIMIIIIIIMIIh 

ATGTCATTTGGTGCCAGTATTTTTTAACTGCATAATAGCCTAACATGATTATTTGAACTT 54 0 
ATTTACACATAGTTTGAAAAAAAAAAGACAAAAATAGTATTC^ 2368 

IMIIIMIIMIMI IIIIIIIIIIIIIIIMIIMIIMI 



Db 


541 


Qy 


2369 


Db 


601 


Qy 


2429 


Db 


661 


Qy 


2489 


Db 


721 


Qy 


2549 


Db 


781 


Qy 


2609 


Db 


841 


Qy 


2669 


Db 


901 


Qy 


2729 


Db 


961 


Qy 


.2789 


Db 


1021 


Qy 


2849 


Db 


1081 


Qy 


2909 


Db 


1141 


Qy 


2969 


Db 


1201 


Qy 


3029 


Db 


1261 


Ov 


? OA Q 

O U O Zf 


Db 


1321 


Qy 


3149 


Db 


1381 



541 ATTTACACATAGTTTGNNNNNNNNNNNNNNNNNNTAGTATTCAGGTGAGCAATTAGATTA 600 
GTATTTTCC^CGTCACTATTTATTTTTTTAAAACACAAATTCTAAAGCTACAACAAATAC 2428 

II 1 M, II II IM : IM MM IM III Ml MM MIMIM 

GTATTTTCCACGTCACTATTTATTTTTTTAAAACACAAATTCTAAAGCTACAACAAATAC 660 
TACAGG CCCTTAAAG CACAGTCTGATGACACATTTGG CAGTTTAATAGATGTTACTCAAA 24 88 

IIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIMMIIII 

TACAGGCCCTTAAAGCACAGTCTGATGACAC^TTTGGC^GTTTAATAGATGTTACTC^AA 7.2 0 
GAATTTTTTAAGAACTGTATTTTATTTTTTAAATGGTGTTTTATTACAAGGGACCTTGAA 254 8 

I MM I MM II II I MINIMI MINIM III II MM I II II. Mill II II II II I 

GAATTTTTTAAGAACTGTATTTTATTTTTTAAATGGTGTTTTATTACAAGGGACCTTGAA 78 0 
CATGTTTTGTATGTTAAATTCAAAAGTAATGCTTCAATCAGATAGTTCTTTTTCACAAGT 2608 

I Ml II IMMI IM MMMI MM II MIM II Ml 

CATGTTTTGTATGTTAAATTCAAAAGTAATGCTTCAATCAGATAGTTCTTTTTCACAAGT 84 0 

TO\ATACTGTTTTTCATGTAAATTTTGTATGAAAAATCAATGTCAAGTACCAAAATGTTA 2668 

■ ! ; I I ! i I I ' 

TCAATACTGTTTTTCATGTAAATTTTGTATGAAAAATCAATGTCAAGTACCAAAATGTTA 900 

ATGTATGTGTCATTTAACTCTGCCTGAGACTTTCAGTGCACTGTATATAGAAGTCTAAAA 2728 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIMIMIMIIIIIIMI 

ATGTATGTGTCATTTAACT CTG CCTGAGACTTTCAGTG CACTGTATATAGAAGTCTAAAA 960 
CACACCTAAGAGAAAAAGATCGAATTTTTCAGATGATTCGGAAATTTTCATTCAGGTATT 2788 

I II Ml Ml II II Ml IM MMMI MM MM II 

CACACCTAAGAGAAAAAGATCGAATTTTTCAGATGATTCGGAAATTTTCATTCAGGTATT 102 0 
TGTAATAGTGACATATATATGTATATACATATCACCTCCTATTCTCTTAATTTTTGTTAA 2 84 8 

IM, II Ml IM IM hM Ml II Ml II MM Ml Ml 

TGTAATAGTGACATATATATGTATATACATATCACCTCCTATTCTCTTAATTTTTGTTAA 108 0 
AATGTTAACTGGCAGTAAGTCTTTTTTGATCATTCCCTTTTCCATATAGGAAACATAATT 2 908 

IM Ml Ml IM IM' 1 !!' in M in ||i in i| !■! ' 

AATGTTAACTGGCAGTAAGTCTTTTTTGATCATTCCCTTTTCCATATAGGAAACATAATT 114 0 
TTGAAGTGGCCAGATGAGTTTATCATGTC^GTGAAAAATAATTACCCAa\AATGCCACCA 2968 

IIIIIIIIMIIIIIMIIIIIIMMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TTGAAGTGGCCAGATGAGTTTATCATGTCAGTGAAAAATAATTACCCACAAATGCCACCA 12 00 
GTAACTTAACGATTCTTCACTTCTTGGGGTTTTCAGTATGAACCTAACTCCCCACCCCAA 3 02 8 

M ; II Ml II IM II Ml II Ml Ml MIMMMiMI 

GTAACTTAACGATTCTTCACTTCTTGGGGTTTTCAGTATGAACCTAACTCCCCACCCCAA 12 60 
CATCTCCCTCCCACATTGTCACCATTTCAAAGGGCCCACAGTGACTTTTGCTGGGCATTT 3088 

MIIIMIIIIIIIIIIMIIIIIIMIIIIIIIMIIIIMIIIIMMIIIMIINI 

CATCTCCCTCCCACATTGTCACCATTTCAAAGGGCCCACAGTGACTTTTGCTGGGCATTT 132 0 
TCCCAGATGTTTACAGACTGTGAGTACAGCAGAAAATCTTTTACTAGTGTGTGTGTGTAT 314 8 

MMMI MM M III I! Ml II , M Ml i II IM, II 

TCCCAGATGTTTACAGACTGTGAGTACAGCAGAAAATCTTTTACTAGTGTGTGTGTGTAT 1380 
ATATATAAACAATTGTAAATTTCTTTTAGCCCATTTTTCTAGACTGTCTCTGTGGAATAT 3208 

MMMI MM IMMM II II II Ml II I II II II 

ATATATAAACAATTGTAAATTTCTTTTAGCCCATTTTTCTAGACTGTCTCTGTGGAATAT 144 0 



Qy 


3209 


Db 


1441 


Qy 


3269 


Db 


1501 


Qy 


• 3329 


Db 


1561 


Qy 


3389 


Db 


1621 


Qy 


3449 


Db 


1681 


Qy 


3509 


Db 


1741 


Qy 


3569 


Db 


1801 


Qy 


3629 


Db 


1861 


Qy 


3689 


Db 


1921 


Qy 


3749 


Db 


1981 


Qy 


3809 


Db 


2041 


Qy 


3869 


Db 


2101 


Qy 


3929 


Db 


2161 


Qy 


3989 


Db 


2221 



ATTTGTGTGTGTGATATATGCATGTGTGTGATGGTATGTATGGATTTAATCTAATCTAAT 3268 

Ml M. MM MM lli II II III III MM MMII II 

ATTTGTGTGTGTGATATATGCATGTGTGTGATGGTATGTATGGATTTAATCTAATCTAAT 1500 
AATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTCTGAGCTAAAATCTAGGTGATTGTTC 332 8 

MMMMMMIMMMMIMMMMMMMMMIMMIMIMMIMMM 

AATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTCTGAGCTAAAATCTAGGTGATTGTTC 1560 
ATCATGAC^ACCTGCCTCAGTCCATTTTAACCTGTAGCAACCTTCTGCATTCATAAATCT 3388 

MMIIMIIIIIIIIIIIIIIIIMIIIIMIIIIMMIIIIIIIIIMIIIIIIIII 

ATCATGACAACCTGCCTCAGTCCATTTTAACCTGTAGCAACCTTCTGCATTCATAAATCT 162 0 
TGTAATC^TGTTACCATTACAAATGGGATATAAGAGGCAGCGTGAAAGCAGATGAGCTGT 3448 

Ml MMMI III M Ml II III III MM MMIM III 

TGTAATCATGTTACCATTACAAATGGGATATAAGAGGCAGCGTGAAAGCAGATGAGCTGT 1680 
GGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGGTTTGATAAAGCAGTATTTGGGGTCA 3508 

II I i 1 1 i III Mi III 1 1 1 1 . 1 1 1 , . 1 1 1 1 1 

GGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGGTTTGATAAAGCAGTATTTGGGGTCA 174 0 
TATTGTTTCCTGTGCTGGAGCAAAAGTCATTACACTTTGAAGTATTATATTGTTCTTATC 3568 

IIIIIIIIIIIIIIMIIIMIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

TATTGTTTCCTGTGCTGGAGCAAAAGTCATTACACTTTGAAGTATTATATTGTTCTTATC 18 00 
CTCAATTCAATGTGGTGATGAAATTGCCAGGTTGTCTGATATTTCTTTCAGACTTCGCCA 3 62 8 

IIMIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIMIIIIIIIMMIIIIIIIIMI 

CTCAATTCAATGTGGTGATGAAATTGCCAGGTTGTCTGATATTTCTTTCAGACTTCGCCA 1860 
GACAGATTGCTGATAATAAATTAGGTAAGATAATTTGTTGGGCCATATTTTAGGACAGGT 3688 

MINIM II Mill I III Mill MM Mill Mill MINIMI INI Mill MM 

GACAGATTGCTGATAATAAATTAGGTAAGATAATTTGTTGGGCCATATTTTAGGACAGGT 192 0 
AAAATAACATCAGGTTCCAGTTGCTTGAATTGCAAGGCTAAGAAGTACTGCCCTTTTGTG 374 8 

Ml I MM M M I II II MM Ml II II 1 1 III II Ml 1 1 II II III II II Mill MM 

AAAATAACATCAGGTTCCAGTTGCTTGAATTGCAAGGCTAAGAAGTACTGCCCTTTTGTG 1980 
TGTTAGCAGTCAAATCTATTATTCCACTGGCGCATCATATGCAGTGATATATGCCTATAA 38 08 

IMIIIIIIIIIMIIMIMIIIIIIIIMIIIIIIIIIIIMIMIIIIIIIMIIII 

TGTTAGCAGTCAAATCTATTATTCCACTGG CG CATCATATG CAGTGATATATG CCTATAA 2 04 0 
TATAAGCCATAGGTTCACACCATTTTGTTTAGACAATTGTCTTTTTTTCAAGATGCTTTG 3868 

1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TATAAGCCATAGGTTCACACCATTTTGTTTAGACAATTGTCTTTTTTTCAAGATGCTTTG 2100 
TTTCTTTCATATGAAAAAAATGCATTTTATAAATTCAGAAAGTCATAGATTTCTGAAGGC 3928 

I M I M M II II I II II II M I II M 1 1 1 1 1 1 1 II M 1 1 1 M I II M M M 1 1 II I M II 

TTTCTTTCATATGAAAAAAATGCATTTTATAAATTCAGAAAGTCATAGATTTCTGAAGGC 2160 

GTCAACGTGCATTTTATTTATGGACTGGTAAGTAACTGTGGTTTACTAGCAGGAATATTT 3 988 

I I M II II II I I II I I I I I I II II II I II I II I I II I M I I I II I I M I II II I II II II 
GTCAACGTGCATTTTATTTATGGACTGGTAAGTAACTGTGGTTTACTAGCAGGAATATTT 222 0 

CCAATTTCTACCTTTACTAC^TCTTTTC^ACAAGTAACTTTGTAGAAATGAGCCAGAAGC 4 04 8 

I I I I M I I M II II Ml II I I I II I I II II II I I I I I II I I I I I I I I M M I I I I II I I I 
CCAATTTCTACCTTTACTACATCTTTTCAACAAGTAACTTTGTAGAAATGAGCCAGAAGC 2280 



Qy 4 04 9 CAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAAAATAAAAGTTTACAGAAACCTT 4105 

INN 'II II II 'II MINIMI II 'II IMUI 'II 

Db 2281 CAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAAAATAAAAGTTTACAGAAACCTT 2337 



RESULT 15 

S63938 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REMARK 



FEATURES 

source 



gene 
CDS 



BASE COUNT 
ORIGIN 



S63938 1868 bp mRNA linear . PRI 07-MAY-1993 

A-type endothelin receptor [human, placenta, mRNA, 1868 nt] . 
S63938 

S63938 . 1 GI :238635 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1868) 

Adachi,M., Yang,Y.Y., Furuichi,Y. and Miyamoto, C. 

Cloning and characterization of cDNA encoding human A-type 

endothelin receptor 

Biochem. Biophys . Res. Commun. 180 (3), 1265-1272 (1991) 

92062153 

1719979 

GenBank staff at the National Library of Medicine created this 
entry [NCBI gibbsq 63938] from the original journal article. 
This sequence comes fromfigl. 

Locat ion/ Qual i f iers 

1. .1868 

/organism="Homo sapiens" 
/ mo 1 _ t yp e = " mRNA " 
/db_xref ="taxon: 9606" 
1. .1868 

/gene="A-type endothelin receptor, ETA" 
74. .1357 

/gene="A-type endothelin receptor, ETA" 
/note="This sequence comes from figl; ETA" 
/codon_start=l 

/product = "A-type endothelin receptor" 
/protein_id="AAB2 02 78 .1" 
/db_xref="GI : 238636" 

/ 1 rans 1 a t i on = " METLCLRAS FWLALVGCVI SDN P ER YSTNLSNHVDDFTTFRGTE 
LSFLVTTHQPTNLVLPSNGSMHNYCPQQTKI TSAFKYI NTVI SCTI F I VGMVGNATLL 
RI I YQNKCMRNGPNALI ASLALGDLI YWI DLP I NVFKLLAGRWPFDHNDFGVFLCKL 
FPFLQKSSVGITVLNLCALSVDRYRAVASWSRVQGIGI PLVTAIEI VSIWILSFILAI 
PEAIGFVMVPFEYRGEQHKTCMLNATSKFMEFYQDVKDWWLFGFYFCMPLVCTAIFYT 
LMTCEMLNRRNGSLR I ALS EHLKQRRE VAKTVFCLWI FALCWFPLHLSRILKKTVYN 
EMDKNRCELLSFLLLMDYIGINLATMNSCINPIALYFVSKKFKNCFQSCLCCCCYQSK 
SLMTS VPMNGTS I QWKNHDQNNHNTDRSSHKDSMN " 
512 a 419 c 389 g 548 t 



Query Match 44.3%; Score 1819.6; DB 9; Length 18 68; 

Best Local Similarity 99.7%; Pred. No. 0; 

Matches 1844; Conservative 0; Mismatches 4; Indels 2; Gaps 2; 
Qy 421 AAAAAGTGAAGGTGTAAAAGCAGCACAAGTG CAATAAGAGATATTTC CTCAAATTTG CCT 480 

>l : ll II ill Ihlll ,11 iMMII,, || Ml Ml Ml Ml | 



Db 



10 AAAAAGTGAAGGTGTAAAAGCAGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCT 69 



Qy 


481 


Db 


70 


Qy 


541 


Db 


130 


Qy 


601 


Db 


190 


Qy 


661 


Db 


250 


Qy 


721 


Db 


310 


Qy 


781 


Db 


370 


Qy 


841 


Db 


430 


Qy 


'901 


Db 


490 


Qy 


961 


Db 


550 


Qy 


1021 


Db 


610 


Qy 


1081 


Db 


670 


Qy 


1141 


uu 


/ J u 


Qy 


1201 


Db 


790 


Qy 


1261 


Db 


850 



CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 54 0 

M III II II Ml II MM II II II M MM II II' I 

CAAGATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAAT 12 9 
CAGTGATAATC CTGAGAGATACAG CACAAAT CTAAG CAATCATGTGGATGATTTCACCAC 600 

1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II I II 1 1 1 Ml 1 1 1 

CAGTGATAATC CTGAGAGATACAG CACAuAA.TCTAAG CAATCATGTGGATGATTTCACCAC 189 
TTTTCGTGGC^CAGAGCTC^GCTTCCTGGTTACC^CT(^TC^CCCACTAATTTGGTCCT 660 

IMIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIINIIIIIIIIIIIIMIII 

TTTTCGTGGC^CAGAGCTC^GCTTCCTGGTTACC^CTCATCAACCCACTAATTTGGTCCT 24 9 
ACCCAGCAATGGCTCAATGCACAACTATTGCCC^ 72 0 

MIIMIIIIIIMIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIII 

ACCCAGCAATGGCTCAATGCACAACTATTGCCC^ 309 
CAAATACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGC 78 0 

I II MM II Ml Ml II M II II II II Ml Ml M! 

CAAATACATTAACACTGTGATAT CTTGTACTATTTTCATCGTGGGAATGGTGGGGAATG C 3 69 

AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 84 0 

I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
AACTCTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGAT 42 9 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 900 

M II M IM MM II M II II Ml 1 II' MMM Ml I - 

AGCCAGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATT 4 8 9 
TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 960 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TAAGCTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAA 54 9 
GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 1020 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

GCTGTTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCT 609 
TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 1080 

Ml II MM II I II II I MM II II Mill II MM II II II I II II MM MM II II I 

TAGTGTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCC 669 
TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 114 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTTGGTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCC 72 9 
TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 12 00 

1 1 II 1 1 1 1 II I M II 1 1 M II I M II II I II II M M I II 1 1 1 1 II II 1 1 1 1 II II II M 

TGAAGCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTG 78 9 
TATGCTCIAATGCCACATCAAAATTCATGGAGTTCTACCAAGATGTAAAG 12 6 0 

1 1 1 1 1 1 1 1 1 II II 1 1 1 1 M 1 1 II M II II I M I II 1 1 1 1 II I M I M M I II M II II II 

TATG CTCAATG C CACATCAAAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGG CT 84 9 
CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 132 0 

I I M I M M I M M II II M I II II II II I II II 1 1 1 M II I II I II II 1 1 1 1 1 1 II I II 

CTTCGGGTTCTATTTCTGTATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGAC 909 



Qy 


1321 


Db 


910 


Qy 


1381 


Db 


970 


Qy 


1441 


Db 


1030 


Qy 


1501 


Db 


1090 


Qy 


1561 


Db 


1150 


Qy 


1621 


Db. 


1210 


Qy 


1681 


Db 


1270 


Qy 


1741 


Db 


1330 


Qy 


1801 


Db 


1390 


Qy 


1861 


Db 


1450 


Qy 


1920 


Db 


1510 


Qy 


1980 


Db 


1570 


Qy 


2040 


Db 


1630 


Qy 


2100 


Db . 


1690 



TTGTGAGATGTTGAACAGAAGGAATGGCAG CTTGAGAATTG CCCT CAGTGAACATCTTAA 1380 

IIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIMIMIIIIMIIIMIIIIIIIIIII 

TTGTGAGATGTTGAACAGAAGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAA 969 
GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 144 0 

IMIIIIIIIIIIMIIIIIIMIIIIIMIIIMIIMIIIIIIIIMIIIIIIIIIII 

GCAGCGTCGAGAAGTGGCAAAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTG 102 9 
GTTCCCTCTTCACTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAA 1500 

i ii i ill 'in ii :m ii :i: ii i, ii 'ii 

GTTCCCTCTTCATTTAAGCCGTATATTGAAGAAAACTGTGTATAACGAGATGGACAAGAA 108 9 
CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 1560 

MM Mill II II INI I III Mill MM Mill llllllll INI 1 1 llllllll III 

CCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAAC 114 9 
CATGAATTCZATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 162 0 

M "II Ml MM M M Ml IM MMM II IM II I 

CATGAATTCATGTATAAACCCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTG 12 09 
TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 168 0 

Mill MIIIIIIIIIMI IIIIMIIIIII lllllllllllllllllllllllll 

TTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCC 12 69 
CATGAACGGAACAAGCATCCAGTGGAAGAACC^ 174 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II I M h II I II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

CATGAACGGAACAAGCATCCAGTGGAAGAACC^CGATCAAAACAAC 1329 
GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1800 

MM II II Ml 1 1 MM III 1 1 MM III Mill Mill MM I II II llllllll MM 

GAGCAGCCATAAGGACAGCATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCAT 1389 
AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGACTCCGGGAATCTCTTCTCTGATC 1860 

IIIIIIIIIIIIIIMIIMIIMMIIIIIIIIIII IIIIIIIMIIIIIIIIMIM 

AATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGAGTCCGGGAATCTCTTCTCTGATC 144 9 

CTTCTTCCTTAATTCACTCCCA.CACCCAAGAAGAAATGCTTTCCAAAACCGCAA - GGTAG 1919 

i I I I I i I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I II I I I I II I I I I I I I I I I I I I 
CTTCTTCCTTAATTCACTCCC^CACCCAAGAAGAAATGCTTTCCAAAACCGCAAGGGTAG 1509 

ACTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACAT 1979 

IIMIMIIIMIIMIMIIIIIIIMIIIIIIIIIMIIIIMIIIIIIMIIIIIM 

ACTGGTTTATCCACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACAT 1569 
ATTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGT 2 039 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIII 

ATTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGT 1629 
TAAATGAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATG 2 099 

Ml 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1| 1 1 1 1 1 1 1 1 1 Ml [| 1 1 1 1 1 1 1 1 1 1 1 Ml II I 1 1 

TAAATGAAACCAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTT CAAGTACATG 1689 

GCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTA 2159 

I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTA 174 9 



Qy 2160 GAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGA - TTTTTTGTCCTTCAGCCAAACA 2218 

II 1 1 II I MM 1 1 MM Mil Mill III I llllll I I II 1 1 1 1 II 1 1 1 1 II 1 1 II I II 

Db 1750 GAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTTGTCCTTCAGCCAAACA 18 09 

Qy 2219 CAATATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTAT 2268 

MINIMUM I ! 1 1 1 1 1 M II i 1 1 1 1 M 1 1 1 1 1 > 1 1 1 1 1 1 1 1 1 1 M ! 

Db 1810 CAATATGGGCTCAAGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTAT 1859 



Search completed: December 12, 2003, 19:39:03 
Job time : 14421.7 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



December 12, 2003, 09:17:01 ; Search time 7395.45 Seconds 

(without alignments) 
13490.718 Million cell updates/s 

US-09-931-157-1 
4105 

1 gaattcgcggccgcctcttg taaaagtttacagaaacctt 4105 

I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 22781392 seqs, 12152238056 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



45562784 



Database 



EST: * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 



em_estba : * 
em_esthum: * 
em_estin : * 
em_estmu : * 
em_estov : * 
em_estpl : * 
em_estro : * 
em_htc : * 
gb_estl : * 
gb_est2 : * 
gb_htc : * 
gb_est3 : * 
gb_est4 : * 
gb_est5 : * 
em_estfun : * 
em_estom: * 
em_gss_hum: * 
em_gss_inv: * 
em_gss_pln: * 
em_gss_vrt : * 
em_gss_fun: * 
em_gss_mam: * 
em_gss_mus : * 
em_gss_pro : * 
em_gss_rod: * 

em _9 ss _P n 9 : * 
em_gss_vrl : * 



28 : gb_gssl : * 
29 : gb_gss2 : * 



PrecL No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 





NO. 


Score 


Match Length DB 


ID 


rjpqpri y*\J- -j f\r-i 

X-/ CD\-i. ±U11 




1 


1069 


26 


0 


3538 


11 


AK043210 * 


AK043210 Miici mu^cii 




2 


797 . 8 


19 


4 


865 


9 


AU1188 82 


ATTIIRRR? AT T1 1 R R R 2 

tt-\J X X O O O £i nU llOOO^ 




3 


792 . 2 


19 


3 


863 


9 


AU1 19546 


AIJ119S46 ATI! 1 9R46 




4 


789 


19 


2 


869 


9 


AU120519 


AU12 0519 ATJ1 2 0^19 




5 


752 . 2 


18 


3 


783 


13 


BX109539 


BX109539 BXj09R^9 

■I— 'x V J- \J ~S ~J ~J Z> OA X \J Z) -J J J 




6 


750 . 6 


18 


3 


3150 


11 


AK042211 


AK04 2211 Mim mu^ni 


c 


7 


725 . 6 


17 


7 


781 


9 


AT 936539 


AT93fi c i^9 wd99hD9 x 

ii. 1 y J U J J J WU^ Zs kJ \J Zs . -A. 




8 


710 . 4 


17 


3 


733 


12 


BM719244 


RM71 9?44 TTT-F-Fm- 


c 


9 


704 . 2 


17 


2 


777 


12 


BO018994 


BO018994 UT-H-DN1- 
oLjuxu^^i ux xx unx 




10 


695 . 8 


17 


0 


764 


9 


AU139639 


AU139639 AU139639 




11 


694 .8 


16 


9 


782 


14 


CD246193 


CD24 6193 AGENCOURT 


c 


12 


694 . 4 


16 


9 


739 


9 


AI694278 


AI69427R wd4S^r)1 x 


c 


13 


692 .2 


16 


9 


732 


13 


B057574 5 


B057S74R TTT-H-F71- 

xj^J / J / *i J wX XX J_jZj X 


c 


14 


687 . 8 


16 


8 


747 


9 


AI809396 


AI809396 wf70b01 x 


c 


15 


685 


16 


7 


705 


12 


BQ006584 


BO006584 IJI-H-KT1- 




16 


683 


16 


6 


742 


10 


BG116534 


BG116534 602317635 




17 


683 


16 


6 


771 


10 


BG699589 


BG699589 602679314 


c 


18 


681.4 


16 


6 


703 


13 


BU681421 


BU681421 UI-CF-EC1 


c 


19 


673 


16 


4 


677 


9 


AI130721 


AI130721 acl6el2 x 


c 


20 


661 


16 


1 


661 


13 


BU948045 


BU948045 io49dl0 x 


c 


21 


653.4 


15 


9 


724 


9 


AI953824 


AI953824 wx69h09.x 




22 


650 . 6 


15 


8 


1072 


12 


BM548868 


RMS4 RRfiR AnFMPOTTRT 




23 


649 . 8 


15 


8 


708 


9 


AU1 17099 


ATI1 17099 ATT117D99 

raw XX 1 \J Z* ZJ cWj XX 1 \J ZJ Zf 


c 


24 


647 . 6 


15 


8 


679 


13 


BU68764 0 

\y \j \j I \j \J 


Rllfifl7640 TTT-PF-Fri 

DUOO / u UX V_17 LjV_X 




25 


640 . 8 


15 


6 


1627 


11 


AK017486 


AK0174R6 Mnq mnqni 


c 


26 


627 


15 


3 


700 


9 


AI913052 


AI913052 tz75f01 x 


c 


27 


621 . 6 


15 


1 


678 


9 


AI804414 


AIR04414 fr71a0^ x 




28 


611 


14 


9 


652 


10 


BF984079 


BF984079 602307S1S 


c 


29 


610 


14 


9 


637 


12 


BM9 98 050 


BM99RDRn TIT-H-DT1- 

Dl'JJ JOU JU UX XI XV X X 




30 


609.4 


14 


8 


978 


14 


CD359210 


CD35 9210 AGEN COURT 




31 


603 .2 


14 


7 


1043 


10 


BG114663 


BG114663 602315469 


c 


32 


603 


14 


7 


621 


12 


BM683196 


BM683196 UI-E-E01- 




33 


596.8 


14 


5 


918 


10 


BF980903 


BF980903 602304437 


c 


34 


595.4 


14 


5 


674 


9 


AI417092 


AI417092 tg78g05.x 


c 


35 


595 


14 


5 


595 


12 


BM512298 


BM512298 ij70c02.x 


c 


36 


582.2 


14 


2 


645 


10 


BE813325 


BE813325 RC3-BN003 


c 


37 


569 


13 


9 


609 


10 


BE328379 


BE328379 hs93g06.x 


c 


38 


564.8 


13 


8 


611 


13 


BU681771 


BU681771 UI-CF-EC1 




39 


563.2 


13. 


7 


676 


10 


BG548379 


BG548379 602575155 


c 


40 


561.8 


13. 


7 


582 


9 


AI144480 


AI144480 qb73e08.x 




41 


554 .4 


13. 


5 


578 


13 


BU948316 


BU948316 io49dl0.y 


c 


42 


552.6 


13 . 


5 


572 


9 


AI240165 


AI240165 qh39fl0.x 




43 


534 .8 


13 . 


0 


595 


9 


AA234178 


AA234178 zr49f02.r 




44 


534.8 


13. 


0 


822 


14 


CD109654 


CD109654 AG EN COURT 


c 


45 


534 .4 


13. 


0 


560 


9 


AU145859 


AU145859 AU145859 



ALIGNMENTS 



RESULT 1 
AK043210 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



AK043210 3538 bp mRNA linear HTC 05-DEC-2002 

Mus musculus 7 days neonate cerebellum cDNA, RIKEN full-length 
enriched library, clone :A730071B21 product : SIMILAR TO ENDOTHELIN 
RECEPTOR TYPE A, full insert sequence. 
AK043210 

AK043210.1 GI :26335584 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y. 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci , 
Itoh,M. , 



, P. # Shibata,Y., Hayatsu,N. , Sugahara,Y., Shibata,K., 
Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki , Y . 
Normalization and subtraction of cap-trapper-selected cDNAs to 
prepare full-length cDNA libraries for rapid discovery of new genes 



Genome Res . 
20499374 
11042159 
3 

Shibata, K. , 



10 (10) , 1617-1630 (2000) 



Itoh,M. , Aizawa,K. 
Konno,H., Akiyama,J., Nishi,K. 
Sumi , N . , I shi i , Y . , Nakamura , S . 



Carninci, P. , 
, Itoh,M., 
Harada,A. , 



Nagaoka , S . , Sasaki , N . , 
Kitsunai,T. , Tashiro,H 
Hazama , M . , Nishine , T . , 
Yamamoto,R., Matsumoto,H. , Sakaguchi , S . , Ikegami,T., Kashiwagi , K. , 
Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda,Y. # Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Muramatsu,M. , Inoue,Y., Kira,A. and Hayashizaki, Y. 
RIKEN integrated sequence analysis (RISA) system--384 -format 
sequencing pipeline with 384 multicapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 
11076861 
4 

Kawai,J., Shinagawa, A. , Shibata,K., Yoshino,M., Itoh,M., Ishii,Y., 
Arakawa , T . , Hara , A . , Fukunishi , Y . , Konno , H . , Adachi , J . , Fukuda , S . , 
Aizawa,K., lzawa,M. , Nishi,K., Kiyosawa,H., Kondo,S., Yamanaka , I . , 
Saito,T., Okazaki,Y., Gojobori,T., Bono,H., Kasukawa,T., Saito,R., 
Kadota,K., Matsuda,H., Ashburner,M. , Batalov,S., Casavant,T., 
Fleischmann, W. , Gaasterland, T. , Gissi,C, King,B., Kochiwa,H., 
Kuehl,P., Lewis,S., Matsuo,Y., Nikaido,I., Pesole,G. , 
Quackenbush, J. , Schriml ,L.M. , Staubli,F., Suzuki, R., Tomita,M., 
Wagner, L., Washio,T., Sakai,K., Okido,T., Furuno,M., Aono,H., 
Baldarelli,R. , Barsh,G., Blake, J., Boffelli,D., Bojunga,N., 
Carninci, P., de Bona 1 do , M . F . , Brownstein, M. J. , Bult,C, 



TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



Fletcher, C. , Fujita,M., Gariboldi , M . , Gust incich, S . , Hill,D., 
Hofmann,M., Hume, D. A., Kamiya,M. , Lee, N . H . , Lyons , P . , 
Marchionni,L. , Mashima,J., Mazzarelli , J . , Mombaerts , P . , Nordone,P., 
Ring,B., Ringwald,M., Rodriguez , I . , Sakamoto, N., Sasaki, H. , 
Sato,K. , Schonbach, C. , Seya,T., Shibata,Y., Storch,K.F., Suzuki, H. , 
Toyo-oka,K., Wang,K.H., Weitz,C, Whittaker , C . , Wilming,L., 
Wynshaw-Boris, A. , Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki, Y. 

Functional annotation of a full-length mouse cDNA collection 

Nature 409 (6821) , 685-690 (2001) 

21085660 

11217851 

5 

The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 (bases 1 to 3538) 

Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci,P., 
Fukuda,S., Furuno,M., Hanagaki,T., Hara,A., Hashizume, W . , 
Hayashida, K. , Hayatsu,N. , Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani,K., Ishii,Y., ltoh,M., Kagawa,I., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y. # Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C, Matsuyama , T. , Miyazaki,A., Murata,M., 
Nakamura,M., Nishi,K., Nomura, K. , Numazaki,R., Ohno,M., Ohsato,N., 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C, Sakai,K., Sakazume,N., 
Sano , H . , Sasaki , D . , Shibata , K . , Shinagawa , A . , Shiraki , T . , 
Sogabe,Y. , Tagami,M. , Tagawa,A. , Takahashi,F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru,A. , Toya,T., Yasunishi , A. , 
Muramatsu,M. and Hayashizaki, Y. 
Direct Submission 

Submitted (16-JUL-2001) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan {E-mail :genome-res@gsc . riken .go . jp, 
URL :http: //genome. gsc.riken.go.jp/, Tel : 81-4 5-503 -9222 , 
Fax:81-45-503-9216) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL:http: / /genome. gsc .riken.go.jp/ 
URL:http: //fantom.gsc .riken.go.jp/ . 

Location/Qualifiers 

1. .3538 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/db_xre f = » FANT0M_DB : A7 3 0 0 7 1 B2 1 " 

/db_xref="taxon: 10090" 

/clone="A73 0071B21" 

/tissue_type=" cerebellum" 

/clone_lib= "RIKEN full-length enriched mouse cDNA library" 



CDS 



BASE COUNT 
ORIGIN 



/dev_stage="7 days neonate" 
310. .1593 

/not e= 11 unnamed protein product; SIMILAR TO ENDOTHELIN 

RECEPTOR TYPE A (SPTR | Q91W2 , evidence: FASTY, 97.7%ID, 

72.1%length, match=924) 

putative" 

/ codon_start=l 

/protein_id="BAC314 93 .1" 

/db_xref="GI : 26335585" 

/ trans lation=" MSI FCLAAYFWLTMVGGVMADNPERYSANLSSHMEDFTPFPGTE 
INFLGTTHRPPNLALPSNGSMHGYCPQQTKITTAFKYINTVISCTIFIVGMVGNATLL 
RIIYQNKCMRNGPNALIASLALGDLIYWIDLPINVFKLLAGRWPFDHNDFGVFLCKL 
FPFLQKSSVGITVLNLCALSVDRYRAVASWSIJVQGIGIPLITAIEIVSIWILSFILlAI 
PEAIGFVMVPFEYKGELHRTCMLNATSKFMEFYQDVKDWWLFGFYFCMPLVCTAIFYT 
LMTCEMLNRRNGSLRIALSEHLKQRREVAKTVFCLWIFALCWFPLHLSRILKKTVYD 
EMDKNRCELLS FLLLMD Y I G I NLATMNS C I N P I AL YF VS KKFKNCFQS CLCCCCHQS K 
SLMTSVPMNGTS I QWKNQEQNNHNTERSSHKDSMN " 
906 a 867 c 781 g 984 t 



Query Match 26.0%; Score 1069; DB 11; Length 3538; 

Best Local Similarity 73.9%; Pred. No. 8.3e-198; 

Matches 1501; Conservative 0; Mismatches 470; Indels 60; Gaps 



9; 



Qy 

Db 



14 0 CCCGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTTCATCCATCCCACCCGGTCG 199 

1 1 II MM! I I MM Ml Mill Ml II II Mill I I 

5 CCTTCTAGTCTGGCAACTGTGTCTAGGAGGTGGGGAGCCTCTCTCTGATCCACCGGACCA 64 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



2 00 TCGCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAAGCAGTGCCCAGGAAGTTTT 25 9 

MM II I III II III I I Mill II II I I 

65 TCG CTGGAGCTTG CAGG CTGAG CAAGATCTCC C CCTAGAGAAG CCT 110 

2 60 CTGAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGCCGGAGCCCGGGACACCGGC 319 

III I II III I I I II I Mill II 1 1 1 1 II 

111 GGCTGTCCGGGGAAGTT TCCCCGAGCTGAGACTGTGCTGCAGCCCTGGTCACCCGC 166 

32 0 CACCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTCTGGCCCAGGCGCCGCGCGG 379 

1 1 II 1 1 llllllllll I II I III MM I IMIIII MM I 

167 CACCCTGCGCGCCACCCTCGTTCTCCAGCTCAGGCTCCGGCTGGCCCGTGCGC G 22 0 

38 0 ACCCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAAAAAAAGTGAAGGTGTAAAA 43 9 

M I iiiiiiiiii ii in i nihil iii ii 

221 GACCTGGAGCTGTCTGCTTCCGAGGAGCTCTAAGGTGAAAAAAAGAAAGGCGTGAGACCA 28 0 
44 0 GC^GC^CT^GTGC^TAAGAGATATTTCCTCAAATTTGCCTCAAGATGGAAACCCTTTGC 4 99 

II I Ml II MM I Mill! I I Mill 

281 ACATAAGAAG ACTTAAAATC CAGGTTAAGATGAGTATCTTTTG C 324 

5 00 CTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGTGATAATCCTGAGAGA 55 9 

II Mill llllllllll I MM III I Mill III Mill llllll 

32 5 CTTGCGGCATACTTTTGGCTGACCATGGTGGGAGGCGTAATGGCTGACAATCCGGAGAGA 3 84 
560 TACAGCACAAATCTAAGCAATCATGTGG 619 

iiMii i 1 1 1 1 j 1 1 r 1 1 ii 1 1 1 1 ii mini 1 1 1 1 1 1 ii ii mi ii 

385 TACAGCGCTAATCTAAGCAGCCACATGGAAGACTTCACCCCTTTTCCGGGGACGGAGATC 444 



Qy 



62 0 AGCTTCCTGGTTACC^CTC^TC^CCCACTAATTTGGTCCTACCCAGC^TGGCTa^TG 679 



I Ml 1 1 II Mill MM MM .1 Ml II IIIMIIIIIIMM 

Db 44 5 AACTTTCTGGGCACCACCCATCGACCCCCTAATTTGGCCCTGCCTAGCAATGGCTCAATG 504 

Qy 680 CACAACTATTGCCCACAGCAGACTAAAATTACTTC^GCTTTC^ATACATTAACACTGTG 73 9 

Ml MMMMMMMMIMIMM || I I I I || I I | | || | II I I I I I I II II 

Db 5 05 CACGGCTATTGCCCACAGCAGACTAAAATCACGACAGCTTTCAAATATATTAACACTGTG 564 

Qy 74 0 ATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATT 7 99 

Mill II II I 1 1 i 11 ' 1 1 ' I MINIM II I MIMI 

Db 565 ATATCCTGCACCATTTTCATCGTGGGAATGGTGGGGAACGCAACTCTACTACGAATCATT 624 

Qy 8 00 TACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGA 859 

Mill Mill . llllllll Mill llllllll II lllllllll 

Db 625 TACCAAAACAAGTGTATGAGGAACGGCCCCAATGCGCTCATAGCCAGCCTGGCCCTTGGA 684 

Qy 860 GACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGC 919 

MIMMMII 1 1 1 1 1 1 1 1 1 M Mill Mill II llllllll MM II Ml 

Db 685 GACCTTATCTACGTGGTCATTGACCTCCCCATCAACGTGTTTAAGCTCTTGGCAGGACGC 744 

Qy 92 0 TGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAG 97 9 

llllllll II llllllll Mill II Mill MMIMMIMIMM Mill 

Db 745 TGGCCTTTCGACCACAATGATTTTGGAGTGTTTCTCTGCAAGCTGTTCCCCTTCCTGCAG 8 04 

Qy 980 AAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGA 1039 

llllllll Mill lllllllll I MIMIII Mill Mill MIMMIMM 

Db 8 05 AAGTCCTCCGTGGGCATCACCGTCTTGAACCTCTGTGCTCTCAGTGTGGACAGGTACAGA 8 64 

Qy 104 0 GCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAA 1099 

Mill II llllllll M Mill Mill Mill II III I M 

Db 865 GCAGTGGCTTCCTGGAGCCGAGTTCAAGGAATCGGGATCCCCTTGATTACCGCCATTGAA 924 

Qy 1100 ATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTC 115 9 

II MMIIIMIMM II Mill Ml II II Mill II lllllllll 

Db 92 5 ATCGTCTCCATCTGGATTCTTTCCTTCATCTTGGCCATCCCGGAAGCAATCGGCTTCGTC 984 

Qy 1160 ATGGTACCCTTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCCACATCA 1219 

MIMMMII Mill I III II I Mill Mill MIMIII Mill II 

Db 985 ATGGTACCCTTCGAATACAAGGGCGAGCTGCATAGGACCTGCATGCTCAACGCCACGTCC 1044 

Qy 122 0 AAATTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGT 127 9 

II lllllllll MIMMMII MINI IIIMII II MIMIII Mill 

Db 104 5 AAGTTCATGGAGTTTTACCAAGATGTGAAGGACTGGTGGCTCTTTGGGTTCTACTTCTGC 1104 

Qy 1280 ATGCCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGA 133 9 

MIMMIIIIIIIIII II ii lllllllll I Mill 

Db 1105 ATGCCCTTGGTGTGCACAGCAATCTTCTACACCCTCATGACCTGTGAGATGCTCAACAGG 1164 

Qy 134 0 AGGAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAAGCAGCGTCGAGAAGTGGCA 13 99 

Mill 1 1 1 f 1 1 J 1 1 I II Mill Mill II M II IIIIIIIMIIIIIMM 

Db 1165 AGGAACGGCAGCTTGCGGATCGCCCTTAGTGAGCACCTCAAACAGCGTCGAGAAGTGGCA 1224 

Qy 14 00 AAAACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGC 14 5 9 

II II II I II II I || II II II I II II II I :l MM 

Db 122 5 AAGACTGTCTTCTGCTTGGTTGTCATCTTCGCCCTGTGCTGGTTCCCTCTTCACTTAAGC 1284 

Qy 14 60 CGTATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGT 1519 

II II MMMIMIMM III I II Mill llllllll MIMI I II II 



Db 



12 85 CGCATTTTGAAGAAAACTGTATATGATGAGATGGATAAGAAC CGGTGTGAACTG CTCAG C 1344 



Qy 152 0 TTCTTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAAC 157 9 

Mill Mill MINIM III Ml M II 1 1 1 1 1 1 1 1 1 M 1 1 M MUM 

Db 134 5 TTCTTGCTGCTAATGGATTACATCGGCATTAACCTGGCAACCATGAATTCTTGCATAAAC 14 04 

Qy 158 0 CCCATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGC 163 9 

II llllllll MIMMMMMMIMM MIMIIMM Mill n 

Db 14 05 CCAATAGCTCTATATTTTGTGAGCAAGAAATTCAAAAATTGTTTTCAGTCCTGCCTCTGT 1464 

Qy 164 0 TGCTGCTGTTACCAGTCCAAAAGTCTGATGACCTCGGTCCCCATGAACGGAACAAGCATC 1699 

Mill III MMIIMIMM II IIIMIIIIIIIIIIMMI Mill II III 

Db 1465 TGCTGTTGTCACCAGTCCAAAAGCCTCATGACCTCGGTCCCCATGAATGGAACGAGTATC 1524 

Qy 1700 CAGTGGAAGAACCACGATCAAAACAACCACAACACAGACCGGAGCAGCCATAAGGACAGC 175 9 

II II 1 1 1 1 1 II 1 1 1 II II f I J 1 1 1 i 1 1 1 1 1 1 1 II MIMIIMM MM 

Db 1525 CAGTGGAAGAA C CAAGAG CAGAACAAC CA CAA CACGGAACGGAG CAG C CA CAAGGA CAG C 1584 

Qy 1760 ATGAACTGACCACCCTTAGAAGCACTCCTCGGTACTCCCATAATCCTCTCGGAGAAAAAA 1819 

lllllll III II MM III II II II I II III I 

Db 158 5 ATGAACTAACCCTCCGCAGAAACACCGAGACGTGTGCCTTCAAGTCCTAGGATGGAAACA 1644 
Qy 182 0 ATCACAAGGCAACTGTGACTCCGGGAATCTCT TCTCTGATCCTTCTTCCTTA 1871 

I II I II II I II II II I II I II II III II 

Db 164 5 ACCATTACGCCACAGATGCGCTCCCAAAACCTCCCAAGTCTCTCCCATGCTCCTTTTCTA 1704 

Qy 1872 ATTCACTCCCACACCCAAGAAGAAATGCTTTCCAAAA.CCGCAAGGTAGACTGGTTTATCC 1931 

lllllll I II MM III III III III I 

Db 1705 AGTCCATCCTAGGAAAAGCTCTCCTGCCCTCCCAACAGCACGTGGTGGACCGGT C 1759 

Qy 1932 ACCCACAACATCTACGAATCGTACTTCTTTAATTGATCTAATTTACATATTCTGCGTGTT 1991 

I I llllllll MIMI MM MM II III 

Db 1760 CCAGCTATAGCCAATGGGTCTTTCCTGAGTACTGTATATGATTTGCATACCGCGCATGTC 1819 

Qy 1992 GTATTCAGCACTAAAAAATGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAATGAAACCA 2 051 

I I II MM Mill II lllllll I III II I I Ml I 

Db 182 0 ATTTCCAACACTTGAAAAT TAGAGCTGGGAGAAAGGAGA - TGATGGTTCAAAGAA 1873 

Qy 2052 GAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGGCTAGCTTTTAT 2111 

I II I I I Mill I MM II Mill MM I Mill I 

Db 18 74 GCCACCTAGCTGCCGCCTTTGCATGAACACAGAGTTTGCAAGTTCATGACCAGCTTCCGT 1933 

Qy 2112 GGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAG 2162 

llllllll M lllllll. II I I II I Mill 

Db 1934 -GCAGTTCTATGGACCAGCTGGTGGGAACTGTCCATCCTAAGATTCTAGAG 1983 



RESULT 2 
AU118882 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



AU118882 
AU118882 
sequence . 
AU118882 

AU118882 .1 GI : 10934117 
EST. 

Homo sapiens 
Homo sapiens 



865 bp mRNA linear EST 01-AUG-2002 
HEMBA1 Homo sapiens cDNA clone HEMBA1004569 5*, mRNA 



(human) 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 865) 

Ota,T., Nishikawa,T. , Suzuki, Y., Ishii,S., Saito,K. , Kawai,Y., 
Yamamoto,J., Wakamatsu,A. , Nakamura,Y., Nagai,T., Sugano,S. and 
Isogai,T. 

HRI human cDNA project 

Unpublished 

Contact: Takao Isogai 

Genomics Laboratory 

Helix Research Institute 

1532-3 Yana, Kisarazu, Chiba 292-0812, Japan 
Tel : 81-438-52-3975 
Fax: 81-438-52-3986 
Email : genomics@hri .co.jp 

HRI human cDNA project; 5'- & 3 ' -end one pass sequencing: Helix 
Research Institute; cDNA library construction: Department of 
Virology, Institute of Medical Science, University of Tokyo, and 
Helix Research Institute. 

Locat ion/ Qual i f iers 

1. .865 

/organism= "Homo sapiens " 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="HEMBA1004 569" 

/tissue_type="whole embryo, mainly head" 
/dev_s t age =" embryo, 10 weeks" 
/clone_l ib= " HEMBA1 « 
/note=" Vector: pME18SFL3" 
271 a 184 c 158 g 247 t 5 others 



Query Match 19.4%; 
Best Local Similarity 98.0%; 
Matches 847; Conservative 



Score 797.8; DB 9; 
Pred. No. 6.1e-145; 
0; Mismatches 12; 



Length 865; 



Indels 



5 ; Gaps 



4; 



Qy 



Db 



14 7 9 TGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATT 153 8 

II I MM II U II' I II.M i Mill I I 

1 TGNATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTCTTACTGCTCATGGATT 60 



Qy 153 9 ACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCCATAGCTCTGTATTTTG 1598 

MM II I'M 'I 'II I! II I II II || || II,. || 

Db 61 ACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACC CCATAGCTCTGTATTTTG 12 0 

Qy 1599 TGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCA 1658 

MUM Mill II II I III; I II,'! MIMIMMMMMIM 

Db 121 TGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAGTCCA 18 0 

Qy 165 9 AAAGTCTGATGACCTCGGTCCCCATGAACGGAACAAGCATCCAGTGGAAGAACCACGATC 1718 

M II IM II II I IM MMI I Ml MMM IM II II MIM 

Db 181 AAAGTCTGATGACCTCGGTCCCCATGAACGGAACAAGCATCCAGTGGAAGAACCACGATC 24 0 

Qy 1719 AAAACAACCACAACACAGAC CGGAG CAG CCATAAGGACAGCATGAACTGAC CAC C CTTAG 1778 

! MMI I! II I II; IMI IM II M' II M, II II M M 

Db 241 AAAACAACCACAACACAGACCGGAGCAGCCATAAGGACAGCA 3 00 

Qy 1779 AAGCACTCCTCGGTACTCCCATAATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGAC 1838 



Db 


301 


Qy 


1839 


Db 


361 


Qy 


1899 


Db 


421 


Qy 


1958 


Db 


481 


Qy 


2018 


Db 


541 


Qy 


2078 


Db 


601 


Qy 


2138 


Db 


661 


Qy 


2197 


Db 


721 


Qy 


2257 


Db 


781 


Qy 


2314 


Db 


841 



! I ! 1 1 ! 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 ! M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 M I E 1 1 1 1 1 1 1 1 1 1 1 

AAGCACTCCTCGGTACTCCCATAATCCTCTCGGAGAAAAAAATCACAAGGCAACTGTGAG 360 
TCCGGGAATCTCTTCTCTGATCCTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATG 1898 

M IMMM MIMM IMMM IM II II III II II MMMIIM 

TCCGGGAA.TCTCTTCTCTGATCCTTCTTCCTTAATTCACTCCCACACCCAAGAAGAAATG 420 
CTTTCCAAAACCGCAA-GGTAGACTGGT^ 1957 

IMIIIIIIIIIIIII IIMIIIIIIIIIMIIMIMIIIIIMIIIIIIIIIIIMI 

CTTTCCAAAACCGCAAGGGTAGACTGGTTTATCCACCCACAACATCTACGAATCGTACTT 480 
CTTTAATTGATCTAATTTACATATTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGA 2 017 

llllllllllllll IIMIIII llllllllllllllllllll llllllllllllllllll 

CTTTAATTGATCTAATTTACATATTCTGCGTGTTGTATTCAGCACTAAAAAATGGTGGGA 54 0 
GCTGGGGGAGAATGAAGACTGTTAAATGAAACCAGAAGGATATTTACTACTTTTGCATGA 2 077 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIMIIIIIIIMIIIIIMI 

GCTGGGGGAGAATGAAGACTGTTAAATGAAACCAGAAGGATATTTACTACTTTTGCATGA 600 
AAATAGAGCTTTCAAGTACATGGCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAATGGG 2137 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AAATAGAG CTTT CAAGTACATGGCTAG CTTTTATGGCAGTTCTGGTGAATGTTCAATGGG 660 
AACTGGTCACCATGAAACTTTAGAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGA - 2196 

' I! Ml Ml Ml Ml IMMM IMMM II III II IMM 

AACTGGTCACCATGAAACTTTAGAGATTAACGACAAGATTTTCTACTTTTTTTAAGTGAT 720 



III I IM II Ml IMMM II MIM II II Ml 1 1 1 1 



TGGTGCCAGTATTTTTTAACTGCATAATAGCCTAACATG - - ATTATTTGAACTTATTTA- 2313 

1 II! IM IM I! Ml IMMM II Ml I 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGGTGCCAGTATTTTTTAACTGCATAATAGCCTAACATGGATTATTTGGAACTTATTTAC 84 0 



I I IIIIIIIMIIIMM III 



RESULT 3 
AU119546 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



AU119546 863 bp mRNA linear EST 01-AUG-2002 

AU119546 HEMBA1 Homo sapiens cDNA clone HEMBA1006071 5', mRNA 
sequence . 
AU119546 

AU119546 .1 GI : 10934781 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 863) 

Ota,T., Nishikawa,T. , Suzuki, Y., Ishii,S., Saito,K., Kawai,Y., 
Yamamoto,J., Wakamatsu,A. , Nakamura,Y., Nagai,T., Sugano,S, and 
Isogai,T. 

HRI human cDNA project 



JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Unpublished 
Contact: Takao Isogai 
Genomics Laboratory- 
Helix Research Institute 

1532-3 Yana, Kisarazu, Chiba 292-0812, Japan 
Tel: 81-438-52-3975 
Fax: 81-438-52-3986 
Email: genomics@hri.co.jp 

HRI human cDNA project; 5"- & 3 ' -end one pass sequencing: Helix 
Research Institute; cDNA library construction: Department of 
Virology, Institute of Medical Science, University of Tokyo, and 
Helix Research Institute. 

Location/Qualifiers 

1. .863 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone="HEMBA1006071 M 

/tissue_type=" whole embryo, mainly head" 
/dev_stage=" embryo, 10 weeks" 
/clone_l ib= " HEMBA1 " 
/ not e= "Vector: pME18SFL3" 
285 a 130 c 130 g 316 t 2 others 



Query Match 19.3%; 
Best Local Similarity 97.8%; 
Matches 845; Conservative 



Score 792.2; DB 9; 
Pred. No. 7.6e-144; 
0; Mismatches 14; 



Length 8 63; 



Indels 



5 ; Gaps 



4; 



Qy 



Db 



2045 GAAACCAGAAGGATATTTACTACTTTTGCATGAAAATAGAGCTTTCAAGTACATGGCTAG 2104 

M, II M Ml M Ml M II Mull II MMM I IM I Ml 

1 GAAAC CAGAAGGATATTTACTACTTTTG CATGAAAATAGAG CTTTCAAGTACATGG CTAG 60 



Qy 2105 CTTTTATGGCAGTTCTGGTGAATGTTCAATGGGAACTGGTCACCATGAAACTTTAGAGAT 2164 

M I MM M IM M IM M M, I Mill Ml II MMI I M 

Db 61 CTTTTATGG CAGTT CTGGTGAATGTT CAATGGGAACTGGTCACCATGAAACTTTAGAGAT 12 0 

Qy 2165 TAACGACAAGATTTTCTACTTTTTTTAAGTGA - - TTTTTTGTCCTTCAGCCAAACACAAT 2222 

I II I Ml I II M MM I I M MMI II IMM M 

Db 121 TAACGACAAGATTTTCTACTTTTTTTAAGTGATTTTTTTTGTCCTTCAGCCAAACACAAT 180 

Qy 2223 ATGGGCTCAGGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATA 2282 

MMIIIII MM II I M MMI IM II MMIIMM M I 

Db 181 ATGGGCTCAAGTCACTTTTATTTGAAATGTCATTTGGTGCCAGTATTTTTTAACTGCATA 24 0 

Qy 2283 ATAGCCTAACATGATTATTTGAACTTATTTACACATAGTTTGAAAAAAAAAAGACAAA^ 2342 

I Ml II ,i MIMI IMM MMM IMM M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 241 ATAG CCTAACATGATTATTTGAACTTATTTACACATAGTTTG - AAAAAAAAAGACAAAAA 299 

Qy 2343 TAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTATTTATTTTTTTAAAAC 24 02 

MMMM II II M MM M II I Ml M I 1 1 1 ! 1 1 1 1 1 ! 1 1 1 1 1 1 

Db 3 00 TAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGTCACTGTTTATTTTTTTAAAAC 35 9 

Qy 24 03 ACAAATTCTAAAGCTACAACAAATACTACA.GGCCCTTAAAGCAC^ 2462 

IMMIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 360 ACAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAAAGCACAGTCTGATGAC^ 419 



Qy 24 63 TGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAAT 2 522 

IIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIMIIMIIIII 

Db 420 TGGCAGTTTAATAGATGTTACTCAAAGAATTTTTTAAGAACTGTATTTTATTTTTTAAAT 47 9 

Qy 2523 GGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTT 2582 

! 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 111 li 1 1 1 1 1 II I II 1 1 Ml 1 1 1 1 1 .11 1 1 I II I 

Db 48 0 GGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATGTTAAATTCAAAAGTAATGCTT 539 

Qy 2583 CAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTTTCATGTAAATTTTGTATGAAA 2642 

M II hill II M II II"! I II I MINI II 'II, J 

Db 54 0 CAATCAGATAGTTCTTTTTCACAAGTTCAAT - CTGTTTTTCATGTAAATTTTGTATGAAA 5 98 

Qy 2643 AATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTC 2702 

MMMMMMMMMMMMMMM MMMMI MM MMMMI MM 

Db 599 AATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCATTTAACTCTGCCTGAGACTTTC 658 

Qy 2703 AGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGAT 2762 

M M 1 mil li r hi ii i 1 ill M Mill ;i :i inn 

Db 65 9 AGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGAAAAAGATCGAATTTTTCAGAT 718 

Qy 2763 GATTCGGAAATTTT CATT CAGGTATTTGTAATAGTGACATATATATGTATATACATATCA 2822 

Mill M MMMM II M' I Ml M M II M M II I M 

Db 719 GATTCAGAAATTTTCATTCAGGTATTTGTAATAGTGACATATATATGNATATACATATCA 778 

Qy 2823 CCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGCAGT-AAGTCTTTTTTGATCAT 2881 

IIIIMIIIIIIIIIIIIIII MINIM I f 1 1 1 1 1 1 1 1 1 III IIIIIIMI III 

Db 779 CCTCCTATTCTCTTAATTTTTCTTAAAATGGTAACTGGCAGTAAAGCCTTTTTTGACCAT 838 

Qy 2882 TCCCTTTTCCATATAGGAAACATA 2 905 

Mill I MM II 

Db 839 TCCCTTTTCATTTAGGGAAACTTA 862 



RESULT 4 
AU120519 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



AU120519 869 bp mRNA linear EST 01-AUG-2002 

AU120519 HEMBB1 Homo sapiens cDNA clone HEMBB1000846 5', mRNA 
sequence. 
AU120519 

AU120519.1 GI:10935754 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 869) 

Ota,T., Nishikawa,T. , Suzuki, Y., Ishii,S., Saito,K., Kawai,Y., 
Yamamoto,J., Wakamatsu,A. , Nakamura,Y., Nagai,T., Sugano,S. and 
Isogai,T. 

HRI human cDNA project 

Unpublished 

Contact: Takao Isogai 

Genomics Laboratory 

Helix Research Institute 

1532-3 Yana, Kisarazu, Chiba 292-0812, Japan 
Tel: 81-438-52-3975 
Fax: 81-438-52-3986 



Email : genomics@hri .co.jp 

HRI human cDNA project; 5'- & 3 ' -end one pass sequencing: Helix 
Research Institute; cDNA library construction: Department of 
Virology, Institute of Medical Science, University of Tokyo, and 
Helix Research Institute. 
FEATURES Location/Qualifiers 
source 1. .8 69 

/organism= "Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/clone= H HEMBB1000846" 

/tissue_type="whole embryo, mainly body" 

/dev_stage=" embryo, 10 weeks" 

/clone_l ib= " HEMBB1 " 

/note= "Vector: pME18SFL3" 
BASE COUNT 277 a 153 c 148 g 289 t 2 others 

ORIGIN 

Query Match 19.2%; Score 789; DB 9; Length 869; 

Best Local Similarity 98.0%; Pred. No. 3.2e-143; 

Matches 850; Conservative 0; Mismatches 12; Indels 5; Gaps 5; 
Qy 1771 ACCCTTAGAAGCACTCCTCGGTACTCCCATAATCCTCTCGGAGAAAAAAATCACAAGGCA 1830 

IIMMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIII 

Db 1 ACCCTTAGAAGCACTCCTCGGTACTCCCATAATCCTCTCGGAGAAAAAAATCACAAGGCA 60 

Qy 1831 ACTGTGACTCCGGGAATCTCTTCTCTGATCCTTCTTCCTTAATTCACTCCCACACCCAAG 18 90 

IMIIM IIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIII 

Db 61 ACTGTGAGTCCGGGAATCTCTTCTCTGATCCTTCTTCCTTAATTCACTCCCACACCCAAG 12 0 

Qy 18 91 AAGAAATGCTTTCGAAAACCGCAA-GGTAGACTGGTT^ 194 9 

I M M II II ,11- I II IIMM IM MMMM.I IMM 

Db 121 AAGAAATGCTTTCC^UW^CCGCAAGGGTAGACTGGTTTATCCACCCAC^CATCTACGAA 18 0 

Qy 195 0 TCGTACTTCTTTAATTGATCTAATTTACATATTCTGCGTGTTGTATTCAGCACTAAAAAA 2 009 

IIIIIIMIMMMIIIIIIIMIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 181 TCGTACTTCTTTAATTGATCTAATTTACATATTCTGCGTGTTGTATTCAGCACTAAAAAA 24 0 

Qy 2 010 TGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAATGAAACCAGAAGGATATTTACTACTT 2 069 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 241 TGGTGGGAGCTGGGGGAGAATGAAGACTGTTAAATGAAACCAGAAGGATATTTACTACTT 3 00 

Qy 2 07 0 TTGCATGAAAATAGAGCTTTCAAGTACATGGCTAGCTTTTATGGCAGTTCTGGTGAATGT 212 9 

IMIIIIIIMIIIIMIIIIIMIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 3 01 TTG CATGAAAATAGAG CTTTCAAGTACATGGCTAG CTTTTATGG CAGTTCTGGTGAATGT 360 

Qy 213 0 TCAATGGGAACTGGTCACCATGAAACTTTAGAGATTAACGACAAGATTTTCTACTTTTTT 218 9 

M IMM IM M II I Ml II Mill I III Mill II I 

Db 361 TCAATGGGAACTGGTCACCATGAAACTTTAGAGATTAACGACAAGATTTTCTACTTTTTT 420 

Qy 2190 TAAGTGA - TTTTTTGTCCTTCAGCCAAACACAATATGGGCTCAGGTCACTTTTATTTGAA 2248 

IMIIM I II M II M II II II MM M II Ml 

Db 421 TAAGTGATTTTTTTGTCCTTCAGCCAAACACAATATGGGCTCAAGTCACTTTTATTTGAA 480 

Qy 224 9 ATGTCATTTGGTGCCAGTATTTTTTAACTGCATAATAGCCTAACATGATTATTTGAACTT 23 08 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 481 ATGTCATTTGGTGCCAGTATTTTTTAACTGCATAATAGCCTAACATGATTATTTGAACTT 54 0 



Qy 2309 ATTTACACATAGTTTGAAAAAAAAAAGACAAAAATAGTATTCAGGTGAGCAATTAGATTA 2368 

M MMM MM MM MMMM I MM M MMM MM 

Db 541 ATTTACACATAGTTTGAAAAAAAAAAGACAAAAATAGTATT^ 600 

Qy 2369 GTATTTTCCACGTCACTATTTATTTTTTTAAAACACAAATTC^ 2428 

I ! 1 1 1 1 1 1 M 1 1 1 II I III lllllllllll III lllllllllllllllll 

Db 601 GTATTTTCCACGTCACTGGTTATTTTTTTAAAACAQ^^ 660 

Qy 242 9 TACAGGCCCTTAAAGCACAGTCTGATGACACATTTGGCAGTTTAATAGATGTTACTCAAA 24 88 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIII 

Db 661 TACAGGCCCTTAAAGCACAGTCTGATGACACA.TTTGGCAGTTTAATAGATGTTACTCAAA 72 0 

Qy 24 89 GAATTTTTTAAGAACTGTATTTTATTTTTTAAATGGTGTTTTATTACAAGGGACCTTGAA 2548 

lllllllllllllllll M MM' IIIIIIIIMIIIIIIIIIIIIIIIII I 

Db 721 GAATTTTTTAAGAACTGNATTTTATTTTTTGAATGGTGTTTTATTACAAGGGACCTTG - A 77 9 

Qy 254 9 CATGTTTTGTATGTTAAATTCAAAAGTAATGCTTCAATCAGATAGTTCTTTTTCACAAGT 2608 

lllllllll III llllllllllllllllllll lllllllllllll lllllllllll 

Db 780 CATGTTTTGGATGGTAAATTCAAAAGTAATGCTTTAATCAGATAGTTC - TTTTCACAAGT 838 

Qy 2609 TCAATACTGTTTTTCATGTAAATTTTG 2635 

Mill III IMIII llllll | 
Db 839 TCAAT- CTGGTTTTCATGTAAATTTGG 864 



RESULT 5 
BX109539 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
COMMENT 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 

Neubert,P., Partsch,E., Peters, M. , 



BX109539 783 bp mRNA linear EST 07-FEB-2003 

BX109539 Soares_pregnant_uterus_NbHPU Homo sapiens cDNA clone 
IMAGp998G221159 ; IMAGE : 487341 , mRNA sequence. 
BX109539 

BX109539. 1 GI : 27877774 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 783) 
Ebert,L., Heil,0., Hennig,S., 
Radelof,U., Schneider, D. and Korn,B. 
Human UnigeneSet - RZPD3 
Unpublished 
Contact: Ina Rolfs 

RZPD Deutsches Ressourcenzentrum fuer Genomf orschung GmbH 
Im Neuenheimer Feld 580, D-69120 Heidelberg, Germany 
RZPD; IMAGp998G221159. 

RZPDLIB ; I.M.A.G.E. cDNA Clone Collection; 
Human UnigeneSet - RZPD3 (RZPDLIB No. 972) 
http : //www. rzpd. de/CloneCards/cgi- 

bin/showLib.pl . cgi/response?libNo=972 Contact: Ina Rolfs 
RZPD Deutsches Ressourcenzentrum fuer Genomf orschung GmbH 
Heubnerweg 6, D- 14 05 9 Berlin, Germany 
Tel: +49 30 32639 101 
Fax: +49 30 32639 111 
www. rzpd.de 

This clone is available royalty-free from RZPD; 



contact RZPD (clone@rzpd.de) for further information. Seq primer: 
M13r, Primer sequence: TTTCACACAGGAAACAGCTATGAC . 
FEATURES Locat ion/ Qua 1 i f iers 

source 1. .783 

/organ ism=" Homo sapiens" 
/ mo l_t ype = " mRNA " 
/db_xref="taxon: 9606" 

/clone="IMAGp998G221159 ; IMAGE : 487341 " 

/sex=" female" 

/ dev_s tage= "adult " 

/lab_host="DH10B" 

/clone_lib="Soares_pregnant_uterus_NbHPU" 
/note=" Organ: uterus; Vector: pT7T3-Pac; Site_l: Not I; 
Site_2: Eco RI ; 1st strand cDNA was primed with a Not I - 
oligo(dT) primer [5 1 

double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia) , digested with Not I and cloned into the Not I 
and Eco RI sites of the modified pT7T3 vector. Library 
went through one round of normalization. Library 
constructed by M. Fatima Bonaldo." 

BASE COUNT 217 a 139 c 155 g 269 t 3 others 

ORIGIN 

Query Match 18.3%; Score 752.2; DB 13; Length 783; 

Best Local Similarity 98.9%; Pred. No. 4.7e-136; 

Matches 777; Conservative 0; Mismatches 6; Indels 3; Gaps 2; 

AAACATAATTTTGAAGTGGCOVGATGAGTTTATCATGTC^GTGAAAAATAATTACCCACA 2 958 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

AAACATAATTTTGAAGTGGCO^GATGAGTTTATCATGTCAGTGAAAAATAATTACCCACA 6 0 
AATGCCACCAGTAACTTAACGATTCTTCACTTCTTGGGGTTTTCAGTATGAACCTAACTC 3 018 

IMIMIIill I I 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 ! 1 1 1 

AATGCCACCAG - - AACTTACGATTCTTCACTTCTTGGGGTTTTCAGTATGAACCTAACTC 118 
CCCACCCC^CATCTCCCTCCCACMTGTC^^ 3078 

i ■ ■ ■ 1 ' : i 1 1 ! mi '. ' i; 

CCCACCCCAACATCTCCCTCCCACATTGTCAC 178 
CTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAAATCTTTTACTAGTGT 3138 

IMIMIIIIIIMIIMMIIMIIIIIIIIMMIIIIIIIIIIIIIIIIIIMIIII 

CTGGGCATTTTCCCAGATGTTTACAGACTGTGAGTACAGCAGAAAATCTTTTACTAGTGT 238 
GTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATTTTTCTAGACTGTCTC 3198 

i :ii M. II I II Mi I ii: r ii I N ll M 

GTGTGTGTATATATATAAACAATTGTAAATTTCTTTTAGCCCATTTTTCTAGACTGTCTC 2 98 
TGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGTATGTATGGATTTAAT 3258 

I M II, II M Mi M M III M II, M II I IM IM I 

TGTGGAATATATTTGTGTGTGTGATATATGCATGTGTGTGATGGTATGTATGGATTTAAT 358 
CTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTCTGAGCTAAAATCTAG 3318 

M II M IMiMIII M MMII I Ml M IM I M Mi l 

CTAATCTAATAATTGTGCCCCGCAGTTGTGCCAAAGTGCATAGTCTGAG - TAAAATCTAG 417 
3319 GTGATTGTTCATCATGAC^CCTGCCTCAGTCCATTTTAACCTGTAGC^ACCTTCTGCAT 3378 



Qy 


2899 


Db 


1 


Qy 


2959 


Db 


61 


Qy 


3019 


Db 


119 


Qy 


3079 


Db 


179 


Qy 


3139 


Db 


239 


Qy 


3199 


Db 


299 


Qy 


3259 


Db 


359 


Qy 


3319 



IIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

Db 418 GTGATTGTTC^TC^TGAC^CCTGCCTCAGTCaVTTTTAACCTGTAGCAACCTTCTGCAT 477 

Qy 3379 TCATAAATCTTGTAAT CATGTTACCATTACAAATGGGATATAAGAGG CAGCGTGAAAG CA 3438 

IM II II 1 ! 'M !M II II II il Ml II II' II II. 

Db 478 TCATAAATCTTGTAAT CATGTTACCATTACAAATGGGATATAAGAGG CAGCGTGAAAG CA 537 

Qy 3439 GATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGGTTTGATAAAGCAGTA 34 98 

III III III I III II lh IMIIIMI I I Ml! II ;ll III 

Db 538 GATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGGTTGGTTTGATAAAGCAGTA 597 

Qy 34 99 TTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTGATTACACTTTGAAGTATTATAT 3558 

II Ml II M IM II II III II M I MMME MM II 

Db 598 TTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACACTTTGAAGTATTATAT 657 

Qy 3559 TGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGTCTGATATTTCTTTCA 3618 

M MM II I IM II II IMMI M I MMMI MM IM 

Db 658 TGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGTCTGATATTTCTTTCA 717 

Qy 3619 GACTTCG CCAGACAGATTG CTGATAATAAATTAGGTAAGATAATTTGTTGGG C CATATTT 3678 

II II Ml M II Ml III I IMMI IM II M 

Db 718 GACTTCG CCAGACAGATTG CTGATAATAAATTANGTAAGATAATTTGNTGGG C CATATTT 777 

Qy 3679 TAGGAC 3684 

II III 

Db 778 TANGAC 783 



RESULT 6 
AK042211 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



AK042211 3150 bp mRNA linear HTC 05-DEC-2002 

Mus musculus 3 days neonate thymus cDNA, RIKEN full-length enriched 
library, clone :A63 0071G03 product : SIMILAR TO ENDOTHELIN RECEPTOR 
TYPE A, full insert sequence. 
AK042211 

AK042211 . 1 GI:26088888 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki , Y . 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 

Itoh,M w Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 

Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10), 1617-1630 (2000) 

20499374 

11042159 

3 

Shibata,K., Itoh,M. , Aizawa,K., Nagaoka,S., Sasaki, N. , Carninci,P., 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



, Ishii, Y. , 
Fukuda , S . , 



TITLE 
JOURNAL 



Konno,H., Akiyama,J., Nishi,K., Kitsunai, T. , Tashiro,H., Itoh,M., 
Sumi # N. # Ishii, Y., Nakamura , S . , Hazama,M., Nishine,T., Harada,A., 
Yamamoto,R. , Matsumoto , H . , Sakaguchi , S . , Ikegami,T., Kashiwagi , K. , 
Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M. # 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Muramatsu, M . , Inoue,Y., Kira,A. and Hayashizaki , Y . 
RIKEN integrated sequence analysis (RISA) system- -384-format 
sequencing pipeline with 384 multicapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 
11076861 
4 

Kawai,J., Shinagawa,A. , Shibata,K., Yoshino,M. , Itoh,M. 
Arakawa,T., Hara,A., Fukunishi , Y . , Konno,H., Adachi,J., 
Aizawa,K., Izawa,M. , Nishi,K., Kiyosawa,H., Kondo,S., Yamanaka , I . , 
Saito,T., Okazaki,Y., Gojobori,T., Bono , H . , Kasukawa,T., Saito,R., 
Kadota,K., Matsuda,H., Ashburner,M. , Batalov,S., Casavant,T., 
Fleischmann, w. , Gaasterland, T. , Gissi,C, King,B., Kochiwa,H., 
Kuehl,P., Lewis, S., Matsuo,Y., Nikaido,I., Pesole,G. , 
Quackenbush, J. , Schriml , L. M . , Staubli,F., Suzuki, R., Tomita,M., 
Wagner, L., Washio,T., Sakai,K., Okido,T., Furuno,M., Aono,H., 
Baldarelli,R. , Barsh,G. , Blake, J., Boffelli,D., Bojunga,N. , 
Carninci,P., de Bonaldo,M. F. , Brownstein, M. J. , Bult,C, 
Fletcher, C, Fujita,M., Gariboldi , M . , Gustincich, S . , Hill,D., 
Hofmann,M., Hume, D. A., Kamiya,M. , Lee , N . H . , Lyons, P., 
Marchionni, L. , Mashima, J. , Mazzarelli , J. , Mombaerts , P . , Nordone,P., 
Ring,B., Ringwald,M., Rodriguez, I . , Sakamoto , N. , Sasaki, H., 
Sato,K. , Schonbach,C. , Seya,T., Shibata,Y., Storch,K.F., Suzuki, H., 
Toyo-oka,K., Wang,K.H., Weitz,C, Whittaker , C. , Wilming,L., 
Wynshaw-Boris,A. , Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki , Y. 

Functional annotation of a full-length mouse cDNA collection 

Nature 4 09 (6821), 685-690 (2001) 

21085660 

11217851 

5 

The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

Analysis of ' the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 (bases 1 to 3150) 

Adachi,J., Aizawa,K., Akimura,T. , Arakawa,T., Bono,H., Carninci,P., 
Fukuda, S., Furuno,M., Hanagaki,T./ Hara,A., Hashizume, W. , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani,K. , Ishii, Y., Itoh,M., Kagawa,I., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C, Matsuyama, T. , Miyazaki,A. , Murata,M. , 
Nakamura,M., Nishi,K., Nomura, K. , Numazaki,R., Ohno,M., Ohsato,N. , 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C, Sakai,K. , Sakazume,N., 
Sano,H., Sasaki, D., Shibata,K., Shinagawa,A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A., Takahashi, F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru,A. , Toya,T., Yasunishi , A. , 
Muramatsu,M. and Hayashizaki , Y . 
Direct Submission 

Submitted (16- JUL-2 001) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 



Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 23 0-0045, Japan (E-mail : genome- res@gsc . riken . go . jp, 
URL: http : / /genome . gsc . riken. go . jp/ , Tel :81-45-503 - 9222 , 
Fax:81-45-503-9216) 
COMMENT cDNA library was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Tissues were provided by Dr. John Todd (Dept. of Medical Genetics 
Wellcome Trust Centre for Molecular Mechanisms in Disease Wellcome 
Trust/MRC building Addenbrookes Hospital Cambridge) whose 
assistance we gratefully acknowledge. 
Please visit our web site for further details. 
URL : ht tp : / /genome . gs c . riken . go . j p/ 
URL: http : / / fantom.gsc . riken. go . jp/ . 
FEATURES Location/Qualifiers 
source 1. .3150 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/ db_xr e f = " FANTOM_DB :A630071G03" 

/db_xref="taxon: 10090" 

/clone="A63 0071G03" 

/ 1 i s sue_type= " thymus " 

/ clone_lib= "RIKEN full-length enriched mouse cDNA library" 

/dev__stage="3 days neonate" 
misc_feature 1. .3150 

/note=" SIMILAR TO ENDOTHELIN RECEPTOR TYPE A (SPTR|Q91W2, 

evidence: FASTY, 97.7%ID, 72.1%length, match=924) " 
BASE COUNT 843 a 781 c 772 g 754 t 

ORIGIN 

Query Match 18.3%; Score 750.6; DB 11; Length 3150; 

Best Local Similarity 75.5%; Pred. No. 9.8e-136; 

Matches 1029; Conservative 0; Mismatches 289; Indels 45; Gaps 6; 



Qy 22 GGTC CCAGAGTGGAGTGGAAGGTCTGGAG CTTTGGGAGGAGACGGGGAGGACAGACTGGA 81 

IMIIIIIIII III MINIMI I II I I II II I I III 

Db 9 GGTCCCAGAGTAGAGAGGAAGGTCTAGGAGCCTGTGGAGTCTAAGGAAGATC - -GCGGGA 66 

Qy 82 GGCGTGTTCCTCCGGAGTTTTCTTTTTCGTGCGAGCCCTCGCGCGCGCGTACAGTCATCC 141 

IMIIIIIIIIIIIIMIII Mill I II Mill Mill I II II 

Db 67 GGCGTGTTCCTCCGGAGTTTGCTTTTCCTTGGGAGCCTCGCCGCGCACACCCA TCCC 123 

Qy 142 CGCTGGTCTGACGATTGTGGAGAGGCGGTGGAGAGGCTTCATCCATCCCACCCGGTCGTC 2 01 

II Mill I I MM III Mill III II II Mill I I II 

Db 124 TTCTAGTCTGGCAACTGTGTCTAGGAGGTGGGGAGCCTCTCTCTGATCCACCGGACCATC 183 

Qy 202 GCCGGGGATTGGGGTCCCAGCGACACCTCCCCGGGAGAAGCAGTGCCCAGGAAGTTTTCT 2 61 

II II I III II Ml I I Mill II II I I 

Db 184 GCTGGAGCTTGCAGGCTGAGCAAGATCTCCC CCTAGAGAAGCCTGG 22 9 

Qy 262 GAAGCCGGGGAAGCTGTGCAGCCGAAGCCGCCGCCGCGCCGGAGCCCGGGACACCGGCCA 321 

MIMIMI I II III I I I III Mill M MM MM 

Db 230 CTGTCCGGGGAAGTT TCCCCGAGCTGAGACTGTGCTGCAGCCCTGGTCArrrnrrA 2fl S 



Qy 322 CCCTCCGCGCCACCCACCCTCGCTTTCTCCGGCTTCCTCTGGCCCAGGCGCCGCGCGGAC 381 

I I I I llllllllll I II I III I I I I I I ! I I I I I 

Db 2 86 CCCTGCGCGCCACCCTCGTTCTCCAGCTCAGGCTCCGGCTGGCCCGTGCGC GGA 339 

Qy 382 CCGGCAGCTGTCTGCGCACGCCGAGCTCCACGGTGAAAAAAAAAGTGAAGGTGTAAAAGC 441 

II I llllllllll II MINI I lllllllllll III III 

Db 34 0 CCTGGAGCTGTCTGCTTCCGAGGAGCTCTAAGGTGAAAAAAAGAAAGGCGTGAGACCAAC 3 99 

Qy 442 AGCACAAGTGCAATAAGAGATATTTCCTCAAATTTGCCTCAAGATGGAAACCCTTTGCCT 501 

I I III II 1 1 1 1 I 1 1 1 1 1 1 I I Ml 

Db 4 00 ATAAGAAG ACTTAAAATCCAGGTTAAGATGAGTATCTTTTGCCT 443 

Qy 502 CAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGTGATAATCCTGAGAGATA 561 

Mill llllllllll I 1 1 1 1 III I Mill Ml Mill llllllll 

Db 444 TGCGGCATACTTTTGG CTGACCATGGTGGGAGG CGTAATGG CTGACAATCCGGAGAGATA 503 

Qy 562 CAGCACAAATCTAAGCAATCATGTGGATGATTTCACCACTTTTCGTGGCACAGAGCTCAG 621 

MM I IIIMIMM II MM II llllll MUM M M III III 

Db 5 04 CAGCGCTAATCTAAG CAG C CACATGGAAGACTTCACC CCTTTTCCGGGGACGGAGATCAA 563 

Qy 622 CTTCCTGGTTACCACTCATC^ 681 

Ml MM Mill MM MM IIIIIMM III II I II I II 1 1 1 1 1 1 M I II 

Db 564 CTTTCTGGGCACCACCCATCGACCCCCTAATTTGGCCCTGCCTAGCAATGGCTCAATGCA 623 

Qy 682 CAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTTCAAATACATTAACACTGTGAT 741 

I I II 1 1 1 M II 1 1 1 1 M I II I M 1 1 II II IIIIIIIIIMIII 

Db 624 CGGCTATTGCCCACAGCAGACTAAAATCACGACAGCTTTCAAATATATTAACACTGTGAT 683 

Qy 742 ATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTA 8 01 

ill ii ii i ii ir L inn ii i iiiiiiii ii i iiiiiiii 

Db 684 ATCCTGCACCATTTTCATCGTGGGAATGGTGGGGAACGCAACTCTACTACGAATCATTTA 743 

Qy 802 CCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGA 861 

Ml Mill MIMIIMM i 1 1 i 1 1 1 ! Mill I II MIMIIMM 

Db 744 CCAAAACAAGTGTATGAGGAACGGCCCCAATGCGCTCATAGCCAGCCTGGCCCTTGGAGA 8 03 

Qy 862 CCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTG 921 

,111 lllllllllll Mill Mill II IIIIIIII MM II Mill 

Db 8 04 CCTTATCTACGTGGTCATTGACCTCCCCATCAACGTGTTTAAGCTCTTGGCAGGACGCTG 863 

Qy 922 GCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAA 981 

MUM II IIIIIIII Mill II Mill MMMIMIMIMM II II 

Db 864 GCCTTTCGACCACAATGATTTTGGAGTGTTTCTCTGCAAGCTGTTCCCCTTCCTGCAGAA 923 

Qy 982 GTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGC 1041 

MIMI Mill Mil I IIIIIIII Mill Mill MM 11,11 

Db 924 GTCCTCCGTGGGCATCACCGTCTTGAACCTCTGTGCTCTCAGTGTGGACAGGTACAGAGC 983 

Qy 1042 AGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAAT 1101 

III II MM III I II Mill Mill Mill M III I II lllllllllll 

Db 984 AGTGGCTTCCTGGAGCCGAGTTCAAGGAATCGGGATCCCCTTGATTACCGCCATTGAAAT 1043 

Qy 1102 TGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCAT 1161 

MMMIMIMM || Mill Ml MM || Mill M MIMIIMM 

Db 1044 CGTCTCCATCTGGATTCTTTCCTTCATCTTGGCCATCCCGGAAGCAATCGGCTTCGTCAT 1103 



Qy 


1162 


Db 


1104 


Qy 


1222 


Db 


1164 


vY 


J. O /U 


Db 


1224 


Qy 


1342 


Db 


1284 



llllll! Mill I III II I Mill Mill 1 1 1 1 r 1 1 1 Mill II II 



I III I II MM II II I MM I MM Mill I 



1 1 1 ! 1 1 1 1 1 1 1 1 1 1 II I M II Ml M M MM | MM 



GAATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAAGCAG 1384 

III MM I I || Mill Mill II II II III 

GAACGG CAG CTTG CGGATCG CC CTTAGTGAG CACCTCAAACAG 1326 



RESULT 7 

AI936539/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



AI936539 781 bp mRNA linear EST 17-DEC-1999 

wd29b09.xl Soares_NFL_T_GBC_Sl Homo sapiens cDNA clone 
IMAGE:2329529 3' similar to gb:S57498 ENDOTHELIN- 1 RECEPTOR 
PRECURSOR (HUMAN) ; , mRNA sequence. 
AI936539 

AI936539.1 GI: 5675409 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 781) 

NCI-CGAP http: //www.ncbi .nlm.nih.gov/ncicgap. 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail .nih.gov 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
Insert Length: 1132 Std Error: 0.00 
Seq primer: -40UP from Gibco 
High quality sequence stop: 456. 

Location/Qualifiers 

1. .781 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref ="taxon: 9606" 

/clone=" IMAGE: 232 952 9" 

/lab_host="DH10B" 

/clone_lib="Soares_NFL_T_GBC_Sl" 

/note="0rgan: pooled; Vector: pT7T3D-Pac (Pharmacia) with 
a modified polylinker; Site_l: Not I; Site_2: Eco RI ; 
Equal amounts of plasmid DNA from three normalized 
libraries (fetal lung NbHL19W, testis NHT, and B-cell 
NCI_CGAP_GCB1) were mixed, and ss circles were made in 
vitro. Following HAP purification, this DNA was used as 
tracer in a subtract ive hybridization reaction. The driver 



was PCR-amplif ied cDNAs from pools of 5,000 clones made 
from the same 3 libraries. The pools consisted of 
I.M.A.G.E. clones 297480-302087, 682632-687239, 
726408-728711, and 729096-731399. Subtraction by Bento 
Soares and M . Fatima Bonaldo. " 

BASE COUNT 264 a 154 c 129 g 232 t 2 others 

ORIGIN 

Query Match 17.7%; Score 725.6; DB 9; Length 781; 

Best Local Similarity 97.1%; Pred. No. 7.3e-131; 

Matches 748; Conservative 0; Mismatches 21; Indels 1; Gaps 1; 

CAACCTGCCTCAGTCCATTTTAACCTGTAGCAACCTTCTGCATTCATAAATCTTGTAATC 33 95 

II! I II M' I ,1 I II M I IIIIIMIIIIII II II II 

C^AACTIGCTTCAGTCC^TTTT-ACCTGTAGCAACCGTCTGCATTCATAATT 715 
ATGTTACCATTACAAATGGGATATAAGAGG CAG CGTGAAAG CAGATGAG CTGTGGACTAG 34 55 

MM Illllll lllllllllllllll lllllllllllllllllllllll || 

ATGTACCCATTACNAATGGGATATAAGAG CAAG CGTGAAAG CAGATGAG CTGTGACTAAG 655 
CAATATAGGGTTTTGTTTGGTTGGTTGGTTTGATAAAGCAGTATTTGGGGTCATATTGTT 3 515 

Mlllllllllll Ml I II M II I'M Mi II II II I 

CAATATAGGGTTTGGTTTGGTTGGTTGGTTTGATANAGCAGTATTTGGGGTCATATTGTT 595 
TCCTGTGCTGGAGCAAAAGTCATTACACTTTGAAGTATTATATTGTTCTTATCCTCAATT 3575 

M Ml II II M I! M MMI I II I. II MIMM II I 

TCCTGTGCTGGAGCAAAAGTCATTACACTTTGGAGTATTATATTGTTCTTATCCTCAATT 535 
CAATGTGGTGATGAAATTGCCAGGTTGTCTGATATTTCTTTCAGACTTCGCCAGACAGAT 3 635 

M MMM ill Mill MMI IIMMMIIIIMII II II II I 

CAATGTGGTGATGAAATTGCCAGGTTGTCTGATATTTCTTTCAGACTTCGCCAGACAGAT 4 75 
TGCTGATAATAAATTAGGTAAGATAATTTGTTGGGCCATATTTTAGGACAGGTAAAATAA 3 695 

I Ml II Ml I II M MINI M II I II MIMMMM 

TGCTGATAATAAATTAGGTAAGATAATTTGTTGGGCCATATTTTAGGACAGGTAAAATAA 415 
CATCAGGTTCCAGTTGCTTGAATTGCAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGC 3755 

! MMM IM MMM I II IM M IMMMII II II II I 

CATCAGGTTCCAGTTGCTTGAATTGCAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGC 355 
AGTCAAATCTATTATTCCACTGG CG CATCATATG CAGTGATATATG CCTATAATATAAG C 3815 

MIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIMIIIII 

AGTCAAATCTATTATTCCACTGG CG CATCATATG CAGTGATATATG CCTATAATATAAG C 2 95 
(^TAGGTTC^CACCATTTTGTTTAGAC^TTGTCTTTTTTTCAAGATGCTTTGTTTCTTT 3875 

M IMMiMI MMM I II Mi MMI IMMI II M II I 

CATAGGTTCACACCATTTTGTTTAGACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTT 235 
CATATGAAAAAAATGCATTTTATAAATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACG 3 93 5 

IIIIIMIIIIIIIIMIIIIIIIMIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIII 

CATATGAAAAAAATGCATTTTATAAATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACG 175 
TGCATTTTATTTATGGACTGGTAAGTAACTGTGGTTTACTAGCAGGAATATTTCCAATTT 3 995 

IIMIIIIIIMIIMIIIIIIMMIIMIMIIIIIIIIIIIIIIIIMIIIMIMI 

TGCATTTTATTTATGGACTGGTAAGTAACTGTGGTTTACTAGCAGGAATATTTCCAATTT 115 
3 996 CTACCTTTACTACATCTTTTCAACAAGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCC 4 055 



Qy 


3336 


Db 


773 


Qy 


3396 


Db 


714 


Qy 


3456 


Db 


654 


Qy 


3516 


Db 


594 


Qy 


3576 


Db 


534 


Qy 


3636 


Db 


474 


Qy 


3696 


Db 


414 


Qy 


3756 


Db 


354 


Qy 


3816 


Db 


294 


Qy 


3876 


Db 


234 


Qy 


3936 


Db 


174 


Qy 


3996 



Illlllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 114 CTACCTTTACTACATCTTTTCAACAAGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCC 55 

Qy 4056 CTGAGTTGGCAGTGGCCCATAAGTGTAAAATAAAAGTTTACAGAAACCTT 4105 

MIIIIIIIIIIMIIIIIIIIMIIMIIIIIIIIII III II 1 1 1 1 

Db 54 CTGAGTTGGCAGTGGCCCATAAGTGTAAAATAAAAGTTAACAAAACCCTT 5 



RESULT 8 
BM719244 

LOCUS BM719244 733 bp mRNA linear EST 01-MAR-2002 

DEFINITION UI-E-EOl-ajd-b-04-0-UI .rl UI-E-EOl Homo sapiens cDNA clone 

UI-E-EOl-ajd-b-04-0-UI 5', mRNA sequence. 
ACCESSION BM7 19244 

VERSION BM719244.1 GI : 19037818 

KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 733) 

AUTHORS Bonaldo,M.F. , Lennon,G. and Soares, M.B. 

TITLE Normalization and subtraction: two approaches to facilitate gene 
discovery 

JOURNAL Genome Res. 6 (9), 791-806 (1996) 
MEDLINE 97044477 
PUBMED 8889548 
COMMENT Contact: Soares , MB 

Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF, Iowa City, IA 52242, USA 
Tel: 319 335 8250 
Fax: 319 335 9565 
Email : bento-soares@uiowa . edu 
Tissue Procurement: Dr. Gregg Hageman 
cDNA Library preparation: Dr. M. Bento Soares, Univeristy of Iowa 
cDNA Library Arrayed by: Dr. M . Bento Soares, Univeristy of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, Univeristy of Iowa 
Clone Distribution: Researchers may obtain clones from Research 
Genetics (www.resgen.com). 
Seq primer: M13 Reverse. 
FEATURES Location/Qualifiers 
source 1. .733 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref ="taxon: 9606" 

/clone="UI-E-EOl-ajd-b-04-0-UI" 

/ 1 i s sue_type= " fetal eye " 

/ dev_stage= " fetal " 

/lab_host= M DH10B (Life Technologies) (Tl phage resistant)" 
/clone_l ib= "UI -E-E01 " 

/note-"Organ: eye; Vector: pT7T3-Pac (Pharmacia) with a 
modified polylinker,- Site_l : EcoR I; Site_2 : Not I; 
UI-E-EOl is a normalized cDNA library containing the 
following tissue(s): fetal eye. The library was 
constructed according to Bonaldo, Lennon and Soares, 
Genome Research, 6:791-806, 1996. First strand cDNA 



synthesis was primed with an oligo-dT primer containing a 
Not I site. Double stranded cDNA was ligated to an EcoR I 
adaptor, digested with Not I, and cloned directionally 
into pT7T3-Pac vector. The oligonucleotide used to prime 
the synthesis of first-strand cDNA contains a library tag 
sequence that is located between the Not I site and the 
(dT) 18 tail. The sequence tag for this library is 
CGCGTATACC . This library was created for the program, Gene 
Discovery in the Visual System, supported by National Eye 
Institute (NEI) . " 

BASE COUNT 199 a 115 c 150 g 267 t 2 others 

ORIGIN 



Query Match 17.3%; Score 710.4; DB 12; Length 733; 

Best Local Similarity 99.4%; Pred. No. 6.7e-128; 

Matches 722; Conservative 0; Mismatches 3; Indels 1; Gaps 1; 



Qy 


3162 


TGTAAATTTCTTTTAGCCCATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTG 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

TGTAAATTTCTTTTAGCCCATTTTTCTAGACTGTCTCTGTGGAATATATTTGTGTGTGTG 


3221 


Db 


1 


60 


Qy 


3222 


ATATATG CATGTGTGTGATGGTATGTATGGATTTAAT CTAATCTAATAATTGTG CC CCG C 

MM II M Ml, 1 II, MMI II Ml II Ml II M IMMMI 

ATATATG CATGTGTGTGATGGTATGTATGGATTTAATCTAATCTAATAATTGTGCCCCGC 


3281 


Db 


61 


120 


Qy 

Db 


3282 
121 


AGTTGTGCCAAAGTGC^TAGTCTGAGCTAAAATCTAGGTGATTGTTOVTCATGACAACCT 

Mill M II M II Mill MMI 1! MMI MM 1 II 

AGTTGTG CCAAAGTG CATAGTCTGAG - TAAAATCTAGGTGATTGTTCATCATGACAACCT 


3341 
179 


Qy 


3342 


GCCTCAGTCCATTTTAACCTGTAGCAACCTTCTGCATTCATAAATCTTGTAATCATGTTA 

II MMI MMM IMMMMI 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GCCTCAGTCCATTTTAACCTGTAGCAACCTTCTGCATCCATAAATCTTGTAATCATGTTA 


3401 


Db 


180 


239 


Qy 


3402 


CCATTAC^WVTGGGATATAAGAGGC^GCGTGAAAGCAGATGAGCTGTGGACTAGayVTAT 

IMIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMMIIIIIIIIIIIIII 

CCATTACAAATGGGATATAAGAGGCAGCGTGAAAGCAGATGAGCTGTGGACTAGCAATAT 


3461 


Db 


240 


299 


Qy 


3462 


AGGGTTTTGTTTGGTTGGTTGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGT 

1 Mill MIMM II II 1 II; ! II I IMIMMIMM 1 M 

AGGGTTTTGTTTGGTTGGTTGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGT 


3521 


Db 


300 


359 


Qy 


3522 


GCTGGAGCAAAAGTCATTACACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGT 

MMI II II 1 MMM' |l MM II IMM! Mill M u 

GCTGGAGOW^GTCATTACACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGT 


3581 


Db 


360 


419 


Qy 


3582 


GGTGATGAAATTGCCAGGTTGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGA 
GGTGATGAAATTG CCAGGTTGTCTGATATTTCTTTCAGACTTCG CCAGACAGATTGCTGA 


3641 


Db 


420 


479 


Qy 


3642 


TAATAAATTAGGTAAGATAATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAG 

MMI Mi MMI M MM 1 II 1 Ml M Ml 1 Ml II M 1 

TAATAAATTAGGTAAGATAATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAG 


3701 


Db 


480 


539 


Qy 


3702 


GTTCCAGTTGCTTGAATTGCAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAA 

MIIMIIIMIIIMIIIIIIIIIIIIIIIIIMIIMIMIIIIIIMIIIMIIII 

GTTCCAGTTGCTTGAATTGCAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAN 


3761 


Db 


540 


599 



Qy 


3762 


ATCTATTATTCCACTGGCGCATCATATGCAGTGATATATGCCTATAATATAAGCCATAGG 


3821 






II 1 1 1 1 . 1 1 1 1 1 1 1 1 M M! 1 1 1 ! 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 




Db 


600 


ATCTATTATTCCACTGG CGCATCATATGCAGTGATATATG CCTATAATATAAG CCATAGG 


659 




7 R 9 9 
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Job 1 






II Ml M M M Ml II M i III II M II Mill IIMI 




Db 


660 


TTCACACCATTTTGTTTAGACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATG 


719 


Qy 


3882 


AAAAAA 3887 




Db 


720 


Mill 
NAAAAA 725 





RESULT 9 

BQ018994/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BQ018994 777 bp mRNA linear EST 27-MAR-2002 

UI-H-DHl-awv-o-07-0-UI . si NCI_CGAP_DH1 Homo sapiens cDNA clone 
IMAGE:5824350 3', mRNA sequence. 
BQ018994 

BQ018 9 94 .1 GI : 19754271 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 777) 

NCI -CGAP ht tp : //www . ncbi . nlm . nih . gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Dr. Jose Mercuende 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Clone distribution information can be found 

through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 

Seq primer: M13 FORWARD 

P0LYA=Yes . 

Location/Qualif iers 
1. .777 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone=" IMAGE: 5824350" 

/tissue_type= "Metastatic Chondrosarcoma" 
/ dev_s t a g e = " Adu 1 1 " 

/lab_host="DH10B (Life Technologies) " 
/clone_l ib= "NCI_CGAP_DH1 " 

/note="0rgan: Lung; Vector: pT7T3-Pac (Pharmacia) with a 
modified polyl inker; Site_l : EcoR I; Site_2 : Not I; 
NCI_CGAP_DH1 is a normalized cDNA library containing the 
following tissue(s): VS-8 Cell line from Metastatic 
Chondrosarcoma in Lung. The library was constructed 
according to Bonaldo, Lennon and Soares, Genome Research, 
6:791-806, 1996. First strand cDNA synthesis was primed 



with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 
with Not I, and cloned directionally into pT7T3-Pac 
vector. The oligonucleotide used to prime the synthesis of 
first -strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT) 18 tail. The 
sequence tag for this library is AGATCATTGC. 
TAG_LIB=UI-H-DH1 
TAG_TISSUE=lung 
TAG_S EQ = AGATCATTG C " 

BASE COUNT 219 a 138 c 166 g 253 t 1 others 

ORIGIN 

Query Match 17.2%; Score 704.2; DB 12; Length 777; 

Best Local Similarity 97.9%; Pred. No. l.le-126; 

Matches 734; Conservative 0; Mismatches 14;* Indels 2; Gaps 2; 
Qy 1595 TTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAG 1654 

I I II I I I II 1 1 1 1 I II IIIMII II III II 'III II I 

Db 750 TATNTGTGAGCAAGAATTTAAAAATGTTTCCAGTCATGCCTCTGCTGCTGCTGTTACCAG 691 

Qy 1655 TCCAAAAGTCTGATGACCTCGGTCCCCATGAACGGAACAAGCATCCAGTGGAAGAACCAC 1714 

M MMII Ml Ml III MUM II IM II MIMM Ml! II I 

Db 690 TCC^AAGTCTGATGACCTCGGTCCCCATGAACGGAACAAGCATCCAGTGGAAGAACCAC 631 

Qy 1715 GAT CAAAA CAA C CA CAACACAGA C CGGAGCAG C CATAAGGA CAG CATGAACTGACCAC C C 1774 

IMIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIII 

Db 63 0 GATCAAAACAACCACAACACAGACCGGAGCAGCCATAAGGACAGC^ 571 

Qy 1775 TTAGAAGCA.CTCCTCGGTACTCCC^TAATCCTCTCGGAGAAAAAAATCACAAGGCAACTG 1834 

M IM II Ml Ml III MMM II M II Ml II Ml II II 

Db 570 TTAGAAGCACTCCTCGGTACTCCCATAATCCTCTCGGAGAAAAAAATCACAAGGCAACTG 511 

Qy 1835 TGACTCCGGGAATCTCTTCTCTGATCCTTCTTCCTTAATTCACTCCCACACCCAAGAAGA 1894 

III MIMMMMI IMIMMMM II IIIMII II MM III II M 

Db 510 TGAGTCCGGGAATCTCTTCTCTGATCCTTCTTCCTTAATTCACTCCCACACCCAAGAAGA 4 51 

Qy 1895 AATGCTTTCCAAAACCGCAA-GGTAGACTGGTTTATCC^ 1953 

I Ml MM III II II III II II III II IM III II 

Db 450 AATGCTTTCC^AAACCGC^GGGTAGACTGGTTTATCCACCCAC^ACATCTACGAATCGT 3 91 

Qy 1954 ACTTCTTTAATTGATCTAATTTACATATTCTGCGTGTTGTATTCAGCACTAAAAAATGGT 2 013 

IM II Ml MM II IM II II II II MM MMII MM 

Db 390 ACTTCTTTAATTGATCTAATTTACATATTCTGCGTGTTGTATTCAGCACTAAAAAATGGT 331 

Qy 2 014 GGGAGCTGGGGGAGAATGAAGACTGTTAAATGAAACCAGAAGGATATTTACTACTTTTGC 2 073 

I II IM IM II II II IM II II Ml II IM IM II 

Db 330 GGGAGCTGGGGGAGAATGAAGACTGTTAAATGAAACCAGAAGGATATTTACTACTTTTGC 2 71 

Qy 2 074 ATGAAAATAGAGCTTTCAAGTACATGGCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAA 2133 

IIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMMIMI 

Db 270 ATGAAAATAGAGCTTTCAAGTACATGGCTAGCTTTTATGGCAGTTCTGGTGAATGTTCAA 211 

Qy 2134 TGGGAACTGGTCACCATGAAACTTTAGAGATTAACGACAAGATTTTCTACTTTTTTTAAG 2193 

MIIIIIIIIIIIIIIIIIIIIIMIIIIMMIIMIIIIIIIIIIMMIIIIIIMI 

Db 210 TGGGAACTGGTCACCATGAAACTTTAGAGATTAACGACAAGATTTTCTACTTTTTTTAAG 151 



Qy 2194 TGA - TTTTTTGTCCTTCAGCCAAACACAATATGGGCTCAGGTCACTTTTATTTGAAATGT 22 52 

III I I! II 'II MM III IMIIIM II I III Mi I 

Db 150 TGATTTTTTTGTCCTTCAGCCAAAC^CAATATGGGCTCAAGTC^CTTTTATTTGAAATC 91 

Qy 2253 CATTTGGTGCCAGTATTTTTTAACTGCATAATAGCCTAACATGATTATTTGAACTTATTT 2312 

I Ml I! II II M II II II II Ml III Ml lllill II 

Db 90 CATTTGGTGCCAGTATTTTTTAACTGCATAATAGCCTAACATGATTATTTGAACTTATTT 31 

Qy 2313 ACACATAGTTTGAAAAAAAAAAGACAAAAA 2342 

IIIIMIIIIIIIIMIIIIIMIIIIII 

Db 30 ACACATAGTTTGAAAAAAAAAAAACAAAAA 1 



RESULT 10 

AU139639 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



764 
sapiens 



bp 

CDNA 



mRNA 
clone 



linear EST 
PLACE1011029 5' , 



05-AUG-2002 
mRNA 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



BASE COUNT 
ORIGIN 



AU139639 

AU13 9639 PLACE1 Homo 
sequence . 
AU13 963 9 

AU139639.1 GI : 11001160 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 764) 

Ota,T., Nishikawa,T. , Suzuki, Y., Ishii,S., Saito,K., Kawai,Y., 
Yamamoto,J., Wakamatsu,A. , Nakamura,Y., Nagai,T., Sugano,S. and 
Isogai,T. 

HRI human cDNA project 

Unpublished 

Contact: Takao Isogai 

Genomics Laboratory 

Helix Research Institute 

1532-3 Yana, Kisarazu, Chiba 292-0812, Japan 
Tel: 81-438-52-3975 
Fax: 81-438-52-3986 
Email : genomics@hri .co.jp 

HRI human cDNA project; 5'- & 3 1 -end one pass sequencing: Helix 
Research Institute; cDNA library construction: Department of 
Virology, Institute of Medical Science, University of Tokyo, and 
Helix Research Institute. 

Location/Qualifiers 

1. .764 

/ organ i sm= " Homo s ap i ens " 
/ mol_type= " mRNA " 
/db_xref ="taxon: 9606" 
/clone=" PLACE1011029" 
/tissue_type="placenta" 
/clone_l ib= " PLACE1 « 
/not e= "Vector : pME18SFL3 " 
252 a 134 c 111 g 263 t 4 others 



Query Match 17.0%; Score 695.8; DB 9 

Best Local Similarity 97.6%; Pred. No. 4.7e-125 
Matches 726; Conservative 0; Mismatches 15 



Length 764; 

Indels 3; Gaps 2; 



Qy 


2322 


Db 


5 


Qy 


2382 


Db 


65 


Qy 


2442 


Db 


125 


Qy 


2502 


Db 


185 


Qy 


2562 


Db 


245 


Qy 


2622 


Db 


304 


Qy 


2682 


Db 


364 


Qy 


2742 


Db 


424 


Qy 


2802 


Db 


484 


Qy 


2862 


Db 


544 


Qy 


2922 


Db 


604 


Qy 


2982 


Db 


662 


Qy 


3042 


Db 


722 



TTGAAAAAAAAAAGACAAAAATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGT 2381 

II M I ,11; M. I II: II II ,1' II' III I : ll III II 

TTTGAAAAAAAAAGACAAAAATAGTATTCAGGTGAGCAATTAGATTAGTATTTTCCACGT 6 4 
CACTATTTATTTTTTTAAAACACAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAA 2441 

1 1 1 1 IIIIMMIIIIIIMIIIIIIIIIIIIIMIIIIMIIMIMIIIIIIIIIII 

CACTGTTTATTTTTTTAAAACACAAATTCTAAAGCTACAACAAATACTACAGGCCCTTAA 124 



IMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIMI 



184 



ACTGTATTTTATTTTTTAAATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATG 2561 

IM II I: II Ml M II II II M III II. M M MIMM 

ACTGTATTTTATTTTTTAAATGGTGTTTTATTACAAGGGACCTTGAACATGTTTTGTATG 244 
TTAAATTCAAAAGTAATGCTTCAATCAGATAGTTCTTTTTCACAAGTTCAATACTGTTTT 2 621 

ill MM I i Ill 1 1 I 1 1 1 II I : : 1 1 1 1 ^ I I i I III 

TTAAATTCAAAAGTAATGCTTCAATCAGATAGTTCTTTTTCACAAGTTCAAT - CTGTTTT 3 03 
TCATGTAAATTTTGTATGAAAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCAT 2681 

III I J 1 1 1 1 1 1 II I 1 1 III 1 1 IE , 1 1 1 ! i M 

TCATGTAAATTTTGTATGAAAAATCAATGTCAAGTACCAAAATGTTAATGTATGTGTCAT 363 
TTAACTCTGCCTGAGACTTTCAGTGCACTGTATATAGAAGTCTAAAACACACCTAAGAGA 2741 

M II M II Ml I II II II M, III I I III IIMM 

TTAACTCTGCCTGAGACTTTCAGTGCACTGTATATAGAAGTCTACAACACACCTAAGAGA 423 
AAAAGATCGAATTTTTCAGATGATTCGGAAATTTTCATTCAGGTATTTGTAATAGTGACA 2 801 

II IIMM II III I Ml MMIM I III IM MMI Ml I 

AAAAGATCGAATTTTTCAGATGATTCAGAAATTTTCATTCAGGTATTTGTAATAGTGACA 483 
TATATATGTATATACATATCACCTCCTATTCTCTTAATTTTTGTTAAAATGTTAACTGGC 2861 

II II I III MIMI IIMM: II M MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TATATATGTATATACATATCACCTCCTATTCTCTTAATTTTTCTTAAAATGTTAACTGGC 543 
AGTAAGTCTTTTTTGATCATTCCCTTTTCCATATAGGAAACATAATTTTGAAGTGGCCAG 2921 

II II I IIMM II II II II Ml II III II II M IN 

AGTAAGT CTTTTTTGATCATT C CCTTTTCCATATAGGAAACATAATTTTGAAGTGGN CAG 603 
ATGAGTTTATCATGTCAGTGAAAAATAATTACCC^C^AATGCCACCAGTAACTTAACGAT 2 981 

IMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII | | MMI 

ATGAGTTTATCATGTCAGTGAAAAATAATTACCCACAAATGCCACCAG - - AACTTACGAT 661 
TCTTCACTTCTTGGGGTTTTCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCA 3041 

M M I IIMM II M II II II Ml IMMI II II Ml 

TCCTCACTTCTTGGGGTTTTCAGTATGAACCTAACTCCCCACCCCAACATCTCCCTCCCA 721 



1 1 II Mill Mill II 



RESULT 11 
CD246193 

LOCUS CD24 6193 



782 bp mRNA linear EST 22-MAY-2003 



DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



AGENCOURT_1412 6996 NIH_MGC_145 Homo sapiens cDNA clone 

IMAGE: 6912753 5*, mRNA sequence. 

CD246193 

CD246193.1 GI :31006657 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 782) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: GPCR Consortium 
cDNA Library Preparation: GPCR Consortium 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/ LLNL at: 

http : / /image . llnl .gov 

Plate: IRBI01 row: a column: 08 

High quality sequence stop: 64 0. 
Locat ion/Qual i f iers 
1. .782 

/organism="Homo sapiens" 
/mol_type= " mRNA " 
/db_xref «"taxon: 9606" 
/clone=" IMAGE: 6912753" 
/tissue_type= "mixed" 
/lab_host="DH10B" 
/clone_l ib= « NIH_MGC_14 5 " 

/note=" Vector: pcDNA3.1; SiteJL: varies by clone; Site_2: 
varies by clone; ORFs were PCR-amplif ied and cloned into 
pcDNA3 . 1 by the GPCR Consortium. Cloning sites vary by 
clone and include the following: 5 1 -EcoRV-Xmnl/XhoI -3 * , 
5 ' -EcoRV-Xmnl/NotI-3 ' , EcoRV (TA cloned, non-directional). 
For information about which gene each clones represents, 
please visit our anonymous ftp site at 

ftp : / /image. llnl . gov/image/rearrayed_plates/lRBI .preSV.dat 
a Note: this is a NIH_MGC Library." 
187 a 194 c 168 g 233 t 



Query Match 16.9%; 
Best Local Similarity 99.7%; 
Matches 696 ; Conservative 



Score 694.8; DB 14; Length 782; 
Pred. No. 7.3e-125; 
0; Mismatches 2; Indels 0; 



Gaps 



0; 



Qy 
Db 



4 85 ATGGAAACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGT 544 

Mill llllllllllllllllllllllllllllllllllllllllllllllllllllll 

9 ATGGATACCCTTTGCCTCAGGGCATCCTTTTGGCTGGCACTGGTTGGATGTGTAATCAGT 68 



Qy 

Db 



545 GATAATCCTGAGAGATACAGCACAAATCTAAGCAATCA^ 604 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

69 GATAATCCTGAGAGATACAGCACAAATCTAAGCAATCATGTGGATGATTTCACCACTTTT 128 



Qy 605 CGTGGCACAGAGCTC7VGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCTACCC 664 

M I M II IMIMM MM II MMM MMM II MM 

Db 12 9 CGTGGCACAGAGCTCAGCTTCCTGGTTACCACTCATCAACCCACTAATTTGGTCCTACCC 188 

Qy 665 AGC^TGGCTCAATGCAC^CTATTGCCCACAGC^GACTAAAATTACTTCAGCTTTCAAA 724 

IIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIMMIIIMIMMIII 

Db 18 9 AGCAATGGCTCAATGCACAACTATTGCCCACAGCAGACTAAAATTACTTCAGCTTTCA^ 24 8 

Qy 72 5 TACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACT 784 

M IMI IMIMMII MM II IMIMM Ml II Ml MM 

Db 24 9 TACATTAACACTGTGATATCTTGTACTATTTTCATCGTGGGAATGGTGGGGAATGCAACT 308 

Qy 785 CTGCTCAGGATCATTTACCAGAACAAATGTATGAGGAATGGCCCCAACGCGCTGATAGCC 844 

M MM I! M M I MMMIMI I II I, II M II 

Db 309 CTG CTCAGGATCATTTACCAGAACAAATGTATGAGGAATGG CC CCAACG CGCTGATAGC C 368 

Qy 845 AGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAG 904 

M MM ii M II I MMM M ||; | || |! || j| | 

Db 3 69 AGTCTTGCCCTTGGAGACCTTATCTATGTGGTCATTGATCTCCCTATCAATGTATTTAAG 428 

Qy 905 CTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTG 964 

MMMIMM M II I MMMM MMIMI! MMM II 

Db 42 9 CTGCTGGCTGGGCGCTGGCCTTTTGATCACAATGACTTTGGCGTATTTCTTTGCAAGCTG 4 8 8 

Qy 965 TTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGT 1024 

I MM I II I II I Ml MM IMI M M II M l 

Db 48 9 TTCCCCTTTTTGCAGAAGTCCTCGGTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGT 548 

Qy 102 5 GTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTG 1084 

IMMM I II I MM Ml MM MM MMI II MM 

Db 54 9 GTTGACAGGTACAGAGCAGTTGCCTCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTG 608 

Qy 1085 GTAACTGCCATTGAAATTGTCTCCATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAA 1144 

IIIIIIMIIIIIIIIIIIIIMI : : . I i I ! : i i 

Db 609 GTAACTGCCATTGAAATTGTCTCCCTCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAA 668 

Qy 114 5 GCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGG 1182 

I MM I II I II MM MM 

Db 669 GCGATTGGCTTCGTCATGGTACCCTTTGAATATAGGGG 7 06 



RESULT 12 

AI694278/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



AI694278 739 bp mRNA linear EST 17-DEC-1999 

wd45a01.xl Soares_NFL_T_GBC_Sl Homo sapiens cDNA clone 
IMAGE: 2331048 3* similar to gb:S57498 ENDOTHELIN- 1 RECEPTOR 
PRECURSOR (HUMAN);, mRNA sequence. 
AI694278 

AI694278 . 1 GI :4971618 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 739) 

NCI-CGAP http: //www.ncbi .nlm.nih.gov/ncicgap. 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 



JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Tumor Gene Index 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail .nih.gov 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
Insert Length: 1706 Std Error: 0.00 
Seq primer: -4 0UP from Gibco 
High quality sequence stop: 458. 

Location/Qualifiers 

1. .739 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref ="taxon: 9606" 

/clone=" IMAGE: 233104 8" 

/ 1 ab_ho s t = " DH 1 0 B " 

/clone_l ib= " Soares_NFL_T_GBC_Sl " 

/note="0rgan: pooled; Vector: pT7T3D-Pac (Pharmacia) with 
a modified polylinker; Site_l: Not I; Site_2 : Eco RI ; 
Equal amounts of plasmid DNA from three normalized 
libraries (fetal lung NbHL19W, testis NHT, and B-cell 
NCI_CGAP_GCB1) were mixed, and ss circles were made in 
vitro. Following HAP purification, this DNA was used as 
tracer in a subtractive hybridization reaction. The driver 
was PCR-amplif ied cDNAs from pools of 5,000 clones made 
from the same 3 libraries. The pools consisted of 
I.M.A.G.E. clones 297480-302087, 682632-68723 9, 
726408-728711, and 729096-731399. Subtraction by Bento 
Soares and M. Fatima Bonaldo. " 
252 a 148 c 120 g 219 t 



Query Match 16.9%; 
Best Local Similarity 98.2%; 
Matches 723; Conservative 



Score 694.4; DB 9; 
Pred. No. 8.8e-125; 
0; Mismatches 11; 



Length 73 9; 



Indels 



2 ; Gaps 



2; 



Qy 



Db 



3370 CTTCTGCATTCATAAATCTTGTAATCATGTTACCATTACAAATGGGATATAAGAGGCAGC 3429 

MM MINI 1 1 1 1 MM I II || MTU 'I MM II II 

73 9 CTTCCGCATTCAT - AATCCTGTATTCATGTT - CCCTTCCAAATGGGATATAAGAGGCAGC 682 



Qy 3430 GTGAAAG CAGATGAG CTGTGGACTAG CAATATAGGGTTTTGTTTGGTTGGTTGGTTTGAT 3489 

IMMM I MM II M I M IM Mill III M II IMM 

Db 681 GTGAAAGCAGATGAGCTGTGGACTACAAATATAGGGTTTTGTTTGGTTGGTTGGTTTGAT 622 



Qy 

Db 

Qy 

Db 

Qy 

Db 



34 90 AAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACACTTTGAA 354 9 

I Ml I M II M M Ml Ml M II Ml I; II II || 

621 AAAGCAGTATTGGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACACTTGAAA 562 
3550 GTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGTCTGATA 3609 

MIIIIMIIMIIIMMIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIMIMIMI 

GTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGTCTGATA 



561 



502 



3610 



501 



TTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATTTGTTGG 3669 

II II IM II MM IMM IM IMM IM IMMIMM IMM 

TTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATTTGTTGG 442 



Qy 



3670 G CCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTG CTTGAATTGCAAGG CTAA 3729 



IMM II M I! M II II IMMIMM II MIM 

3GACAGGTAAAATAACAT CAGGTTCCAGTTGCTTGAATTG CAAGGCTAA 382 



Db 


441 


G CCATATTTTAGGACAGGTAAAATAACAT CAGGTTCCAGTTGCTTGAATTG CAAGGCTAA 


382 


Qy 
Db 


3730 
381 


GAAGTACTGCCCTTTTGTGTGTTAGCAGTC^^TCTATTATTCCACTGGCGCATCATATG 

M 1 MMMI II II 1 IM M II Ml M M I Ml II II 

GAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCATCATATG 


3789 
322 


Qy 

Db 


3790 
321 


CAGTGATATATGCCTATAATATAAGCCATAGGTTCAC^CCATTTTGTTTAGACAATTGTC 

1 Mi II II MMI'MIMI II MMMI II II II Ml MUM 

CAGTGATATATGCCTATAATATAAGCCATAGGTTCACA.CC^TTTTGTTTAGAC^ATTGTC 


3849 
262 


Qy 

Db 


3850 
261 


TTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATTCAGAAA 

II 1 MM III 1 II 1 IM M II Ml MINI III II II 

TTTTTTT(^GATGCTTTGTTTCTTTCATATGAAAAAAATG(^TTTTATAAATTCAGAAA 


3909 
202 


Qy 


3910 


GTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAACTGTGG 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

GTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAACTGTGG 


3969 


Db 


201 


142 


Qy 


3970 


TTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGTAACTTT 

IIIMMIIMII .11 1 IM M II iM M II 1 IM II 1 

TTTACTAGCAGGACTATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGTAACTTT 


4029 


Db 


141 


82 


Qy 


4030 


GTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAAAATAAA 

IIIIIIIIIIIIIIIIMIMIIIMIIIIIMIMIIIMIIIIIIIIIIIIIIIIIII 

GTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCOVTAAGTGTAAAATAAA 


4089 


Db 


81 


22 


Qy 


4090 


AGTTTACAGAAACCTT 4105 

IIIIIIIIIIMIIII 

AGTTTACAGAAACCTT 6 




Db 


21 





RESULT 13 

BQ575745/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



BQ575745 732 bp mRNA linear EST 19-JUN-2002 

UI-H-EZl-bbg-h-22-0-UI . si NCI_CGAP_Ch2 Homo sapiens cDNA clone 
UI-H-EZl-bbg-h-22-0-UI 3', mRNA sequence. 
BQ575745 

BQ575745 . 1 GI : 21479062 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 732) 

NCI -CGAP ht tp : //www . ncbi . nlm . nih . gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP), 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 
Ema i 1 : cgapbs - r@ma i 1 . nih . gov 

Tissue Procurement: Dr. Steven Gitelis/ Rush Presbiterian, Dept. of 
Orthoapedics 

cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Clone distribution information can be obtained 



r 



FEATURES 

source 



BASE COUNT 
ORIGIN 



from Dr. M. Bento Soares, bento-soares@uiowa.edu 
Seq primer: M13 FORWARD 
POLYA=Yes . 

Location/Qualifiers 

1. .732 

/organism="Homo sapiens" 
/mol_type= " mRNA" 
/db_xref ="taxon: 9606" 
/clone="UI-H-EZl-bbg-h-22-0-UI " 
/tissue_type="Chondrosarcoma Grade II" 
/dev_stage= "Adult " 

/lab_host="DH10B (Life Technologies)" 
/clone_l ib= "NCI_CGAP_Ch2 " 

/note="Organ: Left Pelvis; Vector: pT7T3-Pac (Pharmacia) 
with a modified polyl inker; Site_l : EcoR I; Site_2 : Not I; 
NCI_CGAP_Ch2 is a normalized cDNA library containing the 
following tissue(s): Chondrosarcoma Grade II. The library 
was constructed according to Bonaldo, Lennon and Soares, 
Genome Research, 6:791-806, 1996. First strand cDNA 
synthesis was primed with an oligo-dT primer containing a 
Not I site. Double stranded cDNA was ligated to an EcoR I 
adaptor, digested with Not I, and cloned directionally 
into pT7T3-Pac vector. The oligonucleotide used to prime 
the synthesis of first-strand cDNA contains a library tag 
sequence that is located between the Not I site and the 
(dT) 18 tail. The sequence tag for this library is 
TGATCACGCT . 
TAG_L I B = U I - H - E Z 1 

TAG_TISSUE=grade-2- chondrosarcoma 
TAG_S EQ=ATCTAATATG " 
242 a 146 c 110 g 231 t 3 others 



Query Match 16.9%; 
Best Local Similarity 99.1%; 
Matches 694; Conservative 



Score 692.2; DB 13; 
Pred. No. 2.4e-124; 
0; Mismatches 6; 



Length 732; 
Indels 0; 



Gaps 



0; 



Qy 


3406 


Db 


718 


Qy 


3466 


Db 


658 


Qy 


3526 


Db 


598 


Qy 


3586 


Db 


538 


Qy 


3646 


Db 


478 



M I MM II III II MM III I III III Ml I 



TTTTGTTTGGTTGGTTGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTG 352 5 

I J 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 M II II 

TTTTGTTTGGTTGGTTGGTNTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTG 599 

GAGOU^GTCATTAC^CTTTGAAGTATTATATTGTTCTTATCCTC^TTC^ATGTGGTG 3585 

II II IUIMI llllllllllllllllllllllllllllllllllllllll 

GAGCAAAAGTCATTAC^CTNTGAAGTATTATATTGTTCTTATCCTC^TTC^TGTGGTG 53 9 

ATGAAATTGCCAGGTTGTCTGATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAAT 364 5 

1 1 1 M M 1 1 1 1 1 1 1 1 1 III I! 1 1 1 1 1 1 MM II I i 1 1 II ! : i 1 1 1 1 II 1 1 IMIMI 

ATGAAATTG C CAGGTTGTCTGATATTTCTTTCAGACTTCG CCAGACAGATTG CTGATAAT 4 79 

AAATTAGGTAAGATAATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTC 3705 

I III Ml II I II II MM M Ml II! I MM IMM 

AAATTAGGTAAGATAATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTC 419 



Qy 3706 CAGTTGCTTGAATTGCAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCT 3765 

MM 1 1 1 1 1 1 II 1 1 1 II I II 1 1! 1 1 1 1 II I II MM 1 1 1 II III 1 1 1 1 1 1 II 1 1 II 1 1 

Db 418 CAGTTGCTTGAATTGCAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCT 359 

Qy 3766 ATTATTCCACTGGCG CATCATATG CAGTGATATATG CCTATAATATAAG CCATAGGTTCA 3825 

MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I II 1 1 MM III MM 1 1 M I II 1 1 1 1 1 1 1 III 1 1 1 1 M 

Db 358 ATTATTCCACTGG CG CAT CATATG CAGTGATATATG CCTATAATATAAG CCATAGGTTCA 299 

Qy 3826 CACCATTTTGTTTAGACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAA 3885 

IMMI I MM II M Ml II IMM II Ml I III II I 

Db 2 98 CACCATTTTGTTTAGACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAA 239 

Qy 3886 AAATGCATTTTATAAATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTAT 3 945 

IMMI I MM M I! Ml II MMM II III I M: II I 

Db 238 AAATGCATTTTATAAATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTAT 179 

Qy 3 94 6 TTATGGACTGGTAAGTAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTAC 4 005 

IMMI M MM I! M Ml II IMM Ml MM I Ml II I 

Db 178 TTATGGACTGGTAAGTAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTAC 119 

Qy 4 006 TACATCTTTTCAACAAGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGC 4 065 

MMM II II Ml I Ml II II II MIMM MMM IMMI 

Db 118 TACATCTTTTCAACAAGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGC 59 

Qy 4 066 AGTGG CCCATAAGTGTAAAATAAAAGTTTACAGAAACCTT 4105 

M II M II II I II II II II M 

Db 58 AGTGG CCCATAAGTGTAAAATAAAAGTTTACAGAAAC CTT 19 



RESULT 14 

AI809396/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



AI809396 747 bp mRNA linear EST 19-DEC-1999 

wf70b01.xl Soares_NFL_T_GBC_Sl Homo sapiens cDNA clone 
IMAGE: 2360905 3 ! similar to gb:S57498 ENDOTHEL I N - 1 RECEPTOR 
PRECURSOR (HUMAN);, mRNA sequence. 
AI809396 

AI809396.1 GI :5395962 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 747) 

NCI-CGAP http: //www.ncbi .nlm.nih.gov/ncicgap. 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail .nih.gov 

This clone is available royalty- free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
Insert Length: 1717 Std Error: 0.00 
Seq primer: -40UP from Gibco 
High quality sequence stop: 466. 

Loca t ion/ Qual i f iers 

1. .747 

/organism="Homo sapiens" 



/mol_type=" mRNA " 
/db_xref="taxon: 9606" 
/clone=" IMAGE: 2360905" 
/lab_host="DH10B" 
/clone_lib= M Soares_NFL_T_GBC_Sl ,f 

/note="Organ: pooled; Vector: pT7T3D-Pac (Pharmacia) with 
a modified polylinker; Site_l: Not I; Site_2: Eco RI ,- 
Equal amounts of plasmid DNA from three normalized 
libraries (fetal lung NbHL19W, testis NHT, and B-cell 
NCI__CGAP_GCB1) were mixed, and ss circles were made in 
vitro. Following HAP purification, this DNA was used as 
tracer in a subtract ive hybridization reaction. The driver 
was PCR-amplif ied cDNAs from pools of 5,000 clones made 
from the same 3 libraries. The pools consisted of 
I.M.A.G.E. clones 297480-302087, 682632-687239, 
726408-728711, and 729096-731399. Subtraction by Bento 
Soares and M. Fatima Bonaldo. " 

BASE COUNT 250 a 154 c 118 g 221 t 4 others 

ORIGIN 

Query Match 16.8%; Score 687.8; DB 9; Length 747; 

Best Local Similarity 98.1%; Pred. No. 1.7e-123; 

Matches 725; Conservative 0; Mismatches 11; Indels 3; Gaps 3; 
Qy 3368 ACCTTCTGC^TTCATAAATCTTGTAATCATGTTACC^TTACAAATGGGATATAAGAGGCA 3427 

■ I i I I ' Mill MM mill llllllllllllllllll 

Db 742 ACCTTCTGCATTCAT - AATCTGGTAATCATG -TACCATACNNAATGGGATATAAGAGGCA 685 

Qy 3428 GCGTGAAAGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTTGG - TTGGTTT 3486 

MIMII IIMIIIMIIIIIIIIIIIIIMIIIIMIIIIIIIIMI II Mill I 

Db 684 GCGTGAANGCAGATGAGCTGTGGACTAGCAATATAGGGTTTTGTTTGGTGGGTTTGGTNT 625 

Qy 3487 GATAAAG CAGTATTTGGGGTCATATTGTTTC CTGTGCTGGAG CAAAAGT CATTACACTTT 354 6 

Mill I'll I I ' : 1 1 1 1 .1 1 1 I : ' I 1 1 ! I ' , 

Db 624 GATAAAG CAGTATTGGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATTACACTTT 565 

Qy 3547 GAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTTGTCTG 3606 

I M I II M M I M 1 1 1 1 1 1 M M I M 1 1 II II M I M M I M I M 1 1 1 II I M I 

Db 564 GAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGGTGGTCTG 505 

Qy 3 607 ATATTTCTTTCAGACTTCGCCAGACAGATTGCTGATAATAAATTAGGTAAGATAATTTGT 3666 

1:11 II MM! II I Ml II II I I II Mill M IMII 

Db 504 ATATTTCTTTCAGACTTCG CCAGACAGATTG CTGATAATAAATTAGGTAAGATAATTTGT 44 5 

Qy 3667 TGGGCCATATTTTAGGAC^GGTAAAATAACATCAGGT^ 3726 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 i 1 1 j 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 444 TGGGCO^TATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATTGCAAGGC 385 

Qy 3727 TAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCATCAT 3786 

M II I I II II MMM M Ml || | ||i|| | MJI M 

Db 384 TAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGCGCATCAT 32 5 

Qy 3787 ATG CAGTGATATATG CCTATAATATAAGCCATAGGTTCACACCATTTTGTTTAGACAATT 384 6 

M M M I MMM I I MMI M M Ml MM M II MM 

Db 324 ATGCAGTGATATATGCCTATAATATAAGCC^TAGGTTO^CACCATTTTGTTTAGACAATT 265 



Qy 



384 7 GTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATTCAG 3 906 



264 



IIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIMIIIIIIIIMIIIIIIMIIII 

GTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATAAATTCAG 



205 



Db 

Qy 3907 AAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAACTG 3 966 

llllll llllllllllll IMIIIillilllllllMIIIIII IIIMIIIMIMI 

Db 2 04 AAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAAGTAACTG 145 

Qy 3967 TGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGTAAC 4 026 

IMIIIIIIIIIMII IMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIM 

Db 144 TGGTTTACTAGCAGGACTATTTCCAATTTCTACCTTTACTACATCTTTTCAACAAGTAAC 85 

Qy 4027 TTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGTGTAAAAT 4 086 

IIIIIIMIMMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIMIIIIIII 

Db 84 TTTGTAGAAATGAG CCAGAAG CCAAGGCCCTGAGTTGG CAGTGGC CCATAAGTGTAAAAT 25 

Qy 4087 AAAAGTTTACAGAAACCTT 4105 

' lllllllllllllllllll 
Db 24 AAAAGTTTACAGAAACCTT 6 



RESULT 15 

BQ006584/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BQ006584 705 bp mRNA linear EST 26-MAR-2002 

UI-H-EIl-aza-f-14-0-UI . si NCI_CGAP_EI1 Homo sapiens cDNA clone 
IMAGE: 5846029 3', mRNA sequence. 
BQ006584 

BQ006584.1 GI: 19731484 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 705) 

NCI -CGAP http : //www.ncbi .nlm.nih.gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Dr. Jose Mercuende 
cDNA Library preparation: Dr. M. Bento Soares, 
cDNA Library Arrayed by: Dr. M. Bento Soares, 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Clone distribution information can be found 

through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 

Seq primer: M13 FORWARD 

P0LYA=Yes . 

Location/Qualifiers 
1. .705 

/organism= n Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone=" IMAGE: 5846029" 
/ 1 i s sue_t ype= " Chondrosarcoma " 
/ de v_s t a ge = " Adu 1 1 " 

/lab_host="DH10B (Life Technologies)" 
/clone lib="NCI CGAP Ell" 



University of Iowa 
University of Iowa 



BASE COUNT 
ORIGIN 



/note="Organ: Left Pelvis; Vector: pT7T3-Pac (Pharmacia) 
with a modified polyl inker; Site_l: EcoR I; Site_2: Not I; 
NCI_CGAP_EI1 is a normalized cDNA library containing the 
following tissue(s): Chondrosarcoma. The library was 
constructed according to Bonaldo, Lennon and Soares, 
Genome Research, 6:791-806, 1996. First strand cDNA 
synthesis was primed with an oligo-dT primer containing a 
Not I site. Double stranded cDNA was ligated to an EcoR I 
adaptor, digested with Not I, and cloned directionally 
into pT7T3-Pac vector. The oligonucleotide used to prime 
the synthesis of first-strand cDNA contains a library tag 
sequence that is located between the Not I site and the 
(dT)18 tail. The sequence tag for this library is 
ACACTTGCAC . 
TAG_LIB=UI-H-EI1 
TAG_TISSUE=chondrosarcoma 
TAG_S EQ= ACACTTGCAC " 
235 a 143 c 106 g 220 t 1 others 



Query Match 16.7%; 
Best Local Similarity 99.9%; 
Matches 68 5; Conservative 



Score 685; DB 12; Length 705; 
Pred. No. 6e-123; 
0; Mismatches 1; Indels 0; 



Gaps 



0; 



wy 




Db 


705 


Qy 


3480 


Db 


645 


Qy 


3540 


Db 


585 


Qy 


3600 


Db 


525 


Qy 


3660 


Db . 


465 


Qy 


3720 


Db 


405 


Qy 


3780 


Db 


345 


Qy 


3840 


Db 


285 



I i II II II I! II II! I II II II M 



II 



II 



TTGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATT 3539 

llllll llllllllllllllllllllllllllllllllllllllllllilllllll II! 
TTGGTTTGATAAAGCAGTATTTGGGGTCATATTGTTTCCTGTGCTGGAGCAAAAGTCATT 586 

ACACTTTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGG 3599 

Mill IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

ACACTNTGAAGTATTATATTGTTCTTATCCTCAATTCAATGTGGTGATGAAATTGCCAGG 526 
TTGTCTGATATTTCTTTCAGACTT CG CCAGACAGATTG CTGATAATAAATTAGGTAAGAT 365 9 

IIMIIMIIIIIIMIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIMIIMIMI 

TTGTCTGATATTTCTTTCAGACTT CG CCAGACAGATTG CTGATAATAAATTAGGTAAGAT 466 
AATTTGTTGGGCCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTGCTTGAATT 3719 

IIMIIMIIMIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIII 

AATTTGTTGGG CCATATTTTAGGACAGGTAAAATAACATCAGGTTCCAGTTG CTTGAATT 4 06 
GCAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGC 3779 

IIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIMMIIIIIIIIIIIIIIIIIMI 

GCAAGGCTAAGAAGTACTGCCCTTTTGTGTGTTAGCAGTCAAATCTATTATTCCACTGGC 34 6 
GCATCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTTA 3 83 9 

III III II II III MM lllllll INI III llllil 1 1 III MM III II II IN I III 

GC^TCATATGCAGTGATATATGCCTATAATATAAGCCATAGGTTCACACCATTTTGTTT^ 28 6 
GACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATA 38 99 

I MMI M illMII II II MMM II II MMM IMMI II 

GACAATTGTCTTTTTTTCAAGATGCTTTGTTTCTTTCATATGAAAAAAATGCATTTTATA 226 



Qy 



3 900 AATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAA 3 959 



Db 


225 


1 1 1 1 1 1 1 1 1 t I I I I I I I I I I I 1 I I I I I I I I I i I I I t I I I I I I t I I I I I I I i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 II 1 1 1 1 I 1 I 1 I I I I I I M 

AATTCAGAAAGTCATAGATTTCTGAAGGCGTCAACGTGCATTTTATTTATGGACTGGTAA 


166 


Qy 


3960 


GTAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAAC 
i i i i i i i i i i i i t i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 

GTAACTGTGGTTTACTAGCAGGAATATTTCCAATTTCTACCTTTACTACATCTTTTCAAC 


4019 


Db 


165 


106 


Qy 
Db 


4020 
105 


AAGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGT 

Mil IIIIIMMII MINI llllll IIIIIMIIIIIIIIIII llllllllll 

AAGTAACTTTGTAGAAATGAGCCAGAAGCCAAGGCCCTGAGTTGGCAGTGGCCCATAAGT 


4079 
46 


Qy 


4080 


GTAAAATAAAAGTTTACAGAAACCTT 4105 
i 

GTAAAATAAAAGTTTACAGAAACCTT 2 0 




Db 


45 





Search completed: December 12, 2003, 23:51:38 
Job time : 7408.45 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



December 12, 2003, 09:18:36 ; Search time 258.387 Seconds 

(without alignments) 
7347.063 Million cell updates/sec 

US-09-931-157-2 
4301 

1 gagacattccggtgggggac ctgggaaaaaaaaaaaaaaa 4301 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



1139956 



Searched: 569978 seqs, 220691566 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/2/ina/5A_COMB . seq : * 

2 : /cgn2_6/ptodata/2/ina/5B_COMB. seq: * 

3: /cgn2 6/ptodata/2/ ina/6A COMB. seq:* 

4 : /cgn2_6/ptodata/2/ina/6B_COMB.seq: * 

5 : /cgn2_6/ptodata/2/ina/PCTUS_COMB. seq: * 

6 : / cgn2_6/ptodata/2/ina/backf ilesl . seq : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-08-121-446-3 

; Sequence 3, Application US/08121446 
; Patent No. 6313276 
; GENERAL INFORMATION: 

APPLICANT: IMURA, HIROO 

APPLICANT : NAKAO, KAZUWA 

APPLICANT: NAKANISHI, SHIGETADA 

TITLE OF INVENTION: A HUMAN ENDOTHELIN RECEPTOR 
NUMBER OF SEQUENCES: 4 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: MORRISON & FOERSTER 

STREET: 755 Page Mill Road 

CITY: Palo AltO 

STATE: California 

COUNTRY : USA 

ZIP: 94304-1018 
COMPUTER READABLE FORM: 



MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/12 1 , 44 6 
FILING DATE: 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/911,684 
FILING DATE: 10-JUL-1992 
ATTORNEY/AGENT INFORMATION: 
NAME: CIOTTI, THOMAS E. 
REGISTRATION NUMBER: 21,013 
REFERENCE/DOCKET NUMBER: 29900-2 0324.00 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 813-5600 
TELEFAX: (415) 494-0792 
TELEX: 706141 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTER I STI CS : 
LENGTH: 43 01 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
FEATURE : 

NAME /KEY: CDS 
LOCATION: 238. .1566 
US-08-121-446-3 

Query Match 100.0%; Score 4301; DB 4; Length 4301; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4301; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

MMMIIIIIIMIIIIIIIMIIIIIIIIIIMIMIIIIIIIIMIIIIIIIIIMI 

Db 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

Qy 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 120 

IMIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 

Db 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 

Qy 121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 18 0 

IMIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 180 

Qy 181 AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

I! : ' : : M ! II I I; M I M I 

Db 181 AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

Qy 241 CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 3 00 

IMMIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIII 

Db 241 CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 

Qy 301 TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

M 1 M I ,M : II M'll II II I II : ll I ! l! II 

Db 3 01 TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 



Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 



ACCGC^GAGATAATGACGCCACCC^CTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 

I II M II M MM II Ml II I MUM MMI M 

ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 
CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 

IIIIIIIIIMIIIIMIIIIMIIIIIIMIIIIIIIIIIIIIIMMIIIIIMIMI 

CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 
CC^CGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCA^ 54 0 

MMI M M IMI II M MM MMI M M II M II 

CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 
TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

MMI I II IMIMI II II M II M M I MM! I! 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 
CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

IIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIMIMIIIIIIIIIIIIII 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 
AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 72 0 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIMIIIIIIIIIIIIIMIIIII 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 720 
CTGCTGG CAGAGGACTGG CCATTTGGAG CTGAGATGTGTAAG CTGGTG CCTTTCATACAG 78 0 

I M MMMI II I MUM IMIIMMM I II I II 

CTGCTGG CAGAGGACTGGCCATTTGGAG CTGAGATGTGTAAG CTGGTG CCTTTCATACAG 78 0 
AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 

MM II I MMI II IMIMI MMMI MM II MMI 

AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 
GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 

' Mi I ; I . ! ; I - ; . ! ; i i : i i i • i 

GCTGTTG CTTCTTGGAGTAGAATTAAAGGAATTGGGGTTC CAAAATGGACAG CAGTAGAA 900 
ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

I II M Ml Ml II MM III MIMMMII MUM Ml 

ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 
ATAATTACGATGGACTAO^AAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 1020 

IIMIMMIM MIMI II II III II Ml II I MM MM 

ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 1020 
AAGAC^GCTTTCATGCAGTTTTACAAGACAGOW^AGATTGGTGGCTGTTCAGTTTCTAT 1080 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAGACAGCTTTCATGCAGTTTTACAAGACAGC^^ 108 0 

TTCTGCTTGCCATTGGCC^TCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 

IIIIIIMIIIIMIIIIIIIIIIIIIIIMIIIIMIMMMIIIIIIMIIIIIIII 

TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 
AGAAAGAAAAGTGG CATG CAGATTGCTTTAAATGATCACCTAAAG CAGAGACGGGAAGTG 12 00 

MMMMI II MM I MMMI I Ml II MMIIMIM 

AGAAAGAAAAGTGG CATGCAGATTG CTTTAAATGATCAC CTAAAGCAGAGACGGGAAGTG 1200 



Qy 
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Db 
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1261 
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1261 
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1321 


Db 


1321 
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1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 



GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 

; I ! : ! : I : i i ! I ' ' I I i I ! I 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 
AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 132 0 

MIMIIMIIMMMIIIIIIIIIIIIIMIMIIIIIIIIIIIIMIIIIIIIIMI 

AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 1320 



MIIIIIMMIIIIIIIIIIIIIIIIMIIIIMIMIIIIIMIIIIIIIIIIIIMI 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 
AACCCAATTGCTCTGTATTTGGTGAGCAA 144 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 
TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 

IIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIII 

TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 
AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 

IMIMIIIIIIIIIIIMMIMIIIIIIIIIMIIMIIMIMIIIIIIIIIIIIII 

AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 
TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 

IIIIIIMIIIIIII IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIillMIMM 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 
AACAAAATGAAACATTTGCCAAAACAAAACAAAAAAC 1680 

I M :! I I I II II M'l Ml I Ml Mill || | | i| 

AACAAAATGAAACATTTGCCAAAACAAAACAAAAAA 1680 
TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 

I I! I M MM II II I M III I III Mill II I I M 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 
TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 

Mill IMIMMM IMMM MM I M IMM IIIIM II II I 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 
TTTTTACAGTTAGC^CTTCAACATAGCTCTTAACAACTTCCAGGATATTCACACAACACT 1860 

MMM I II M M MMM I M I Ml II II II II M I I 

TTTTTACAGTTAGC^CTTCAACATAGCTCTTAACAACTTCCAGGATATTCACACAACACT 1860 
TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 

I I M I I M M II I I ,1 I III Mill II II II M 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 1920 
AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 198 0 

II. M IMMMMIMIIMM Mil MM; Mill MMM I M I 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 198 0 
AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 

I I MM,M Ml M II I I M; I Ml IMM I M M M 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 
2 041 TATCACACTATTATCAGATTGTAATTAGATG CAAATGAGAGAG CAGTTTAGTTGTTGCAT 2100 



r 



Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 



IMIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIII 

TATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCAT 2100 
TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 

■ i i . I - : ! 1 1 : ! I : I ; ! II I : ; I . 

TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 
TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 222 0 

II II i 11 I ! I i . 1 1 i i : 1:1: i i I i 1 1 1 1 . i 

TTTTGAAAATC^TTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 2220 
CAACATGTCACAAACAAGGAXjCATC 2280 

IIIIIIIIIIIIMIIIIIIIIIMIMIIIIMIIIIIIIIIIIIIIIIMMIIIIII 

CAACATGTC^CAAAC^AGCAGCATGTAACAGACTGGCACATGTGCCAGCTGAATTTAAAA 2280 



IIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIM 



CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIMIII 

CAAAGAGAAATAGAATGTTTGAAAGGCTAT CC CAAAAGACTTTTTTGAAT CTGTCATTCA 24 00 
CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 24 60 

u iiNiiii ui i M ii ii rni mi i ii ir i 

CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 24 60 
TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2 520 

MUM M III II IM III II II I MMM I III Mill 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2 520 
CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 2580 

IIIMIIIIIIIIIMIIIIMIIIMIIIIIIIIIIIIIMIIIIIMIIMIIIIIII 

CTGCATGTAGATGATTAAATGAGGGCAGG CC CTGTG CTCATAG CTTTACGATGGAGAGAT 2580 
GCC^GTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 2 64 0 

1 1 .1 1 1 1 1 1 M 11:1 1 l,M I II: i 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGC^GTGTCCACATGACAAAGGG 2 640 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 27 00 

MMIMIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMMM 
GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 ' 

IIIIIIIIMIIIIIIIIIIIIIMIIIMIIMIIIMIIIIIIIIIIIIIIIIIIIII 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 
TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAG CCAATAGAAAAGTTTG CTTG 2 820 

II IMM MMMM I M M MM I II IMMM! Ml II 

TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 282 0 
ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2 880 

I Ml M IMMM IMIMMI IM I IM IMMMM IMM 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 
AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 

M MM IMMIMII IMM Ml IM I Ml IMMM I IMM| 



Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 


Ov 


O O D X 


Db 


3661 


Qy 


3721 


Db 


3721 



2881 AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 294 0 
GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3 000 

MM IIIIMIMM IMIIIMIMM lllllllllll IMIII IIIIIIIIM 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3 000 
GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3 060 

■M I M !M; ; MM I : I : M: M 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 
CGTCAC^TCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTGCAATGTTCTCA 3120 

IIIMIIIMIIIIMMIIIIMIIMIIMIIIIIIIIIIIIIIIIIMIIIIIMII 

CGTCACATCAATGCAAAAGGTCCTGATTTTGTTC 3120 
GAGTGACTTTCGAAATAAATTGGG CC CAAGAG CTTTAACT CGGTCTTAAAATATG C CCAA 318 0 

I M IM M II M Ml I IM II II MMIMIMM II I III 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 3180 



II III MM III III I III MM III IIIIIIIIM III I III Mill III 1 1 II II III 



Mill IMIMI III IIIIIIIIIIMIMIM IIIIIIIIM MM II I II 



GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 

I I I I I I I I I I I I I I I II I I II II II II I I II II I I I I I II I I I I I I I I II I I I II II I I I 

GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 

ACAAACTTGTTCTTTAATTTCATCCGUVTCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 

MIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIMIIMIIMII 

ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 



Mill MM MIMI lllllllllll IIIIMM II lllllllllll MM M 



TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

I I II M IMM Ml I II M IMIII II I! I II MM 

TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 
TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 

II M M II M III I II M II M MIMMM; II I I 

TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 
TGAAACTACACACAAAAAG CATACTTGCATTATTTATAATAAAATTG CATTCAGTGG CTT 3660 

II IMIMI Ml lllllllllll III IMIMI MIMI M II II II Mill 

TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 
TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 

IIIIIIIIIIIMIIMIIIIIIMIIIIIMIIIIIIIIIIIIIIIIMMIIIIIIII 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 
TTCTTTACATACTCAAAACCAAGATAGAAAAAG 3780 

IMIIIMIIMMIIIMIIIIIMIIIIMIIIIIIIIIIIIIIIIIMIIMIIIII 

TTCTTTACATACTCAAAACCAA.GATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 378 0 



Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 


3961 


Qy 


4021 


Db 


4021 


Qy 


4081 


Db 


4081 


Qy 


4141 


Db 


4141 


Qy 


4201 


Db 


4201 


Qy 


4261 


Db 


4261 



TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 

1 1 M I ! 1 1 1 1 1 1 1 1 1 1 1 1 !M 1 1 1 1 1 1 II 1 1! 1 1 M 1 1 1 II 1 1 1 i 1 1 1 II 1 1 1 1 1 II I 

TTCCTAGTATTAAGGACTTTAATATAGC^ACAGACAAAATTATTGTTAACATGGATGTTA 384 0 

CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3 900 

I I :MI M M IM ! I U ' I I 

CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3 900 

GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 

IIIIIIIIIIIMIIMIIIIIMIIMIIIMIIIIMIIIIIMIMIIIIIIIIIII 

GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3960 

GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4 02 0 

1 1 1 1! II 1 1 1 1! 1 1 1 II I II 1 1 1 1 1 1 1 1 INI II 1 1 1 1 1 1 1 1 1 1 1 MM 1 1 1 1 II II II 

GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4 02 0 

AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 08 0 

I : I Ml.ll 11 1 1 l ! I ; 1 1 i M II : I 

AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 080 



1 1 1 1 - 1 1 ! ; 1 1 ill j : II II ! I : , 1 1 1 Mill II I I 



ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 4200 

! il II Ml MMIII Ml II: IIMI III III II II! II 

ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 4200 
CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTCAATT 4260 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTC^ATT 4260 
AAAATGCCACATTTCTGGTCTCTGGGAAAAAAAAAAAAAAA 43 01 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 

AAAATGCCACATTTCTGGTCTCTGGGAAAAAAAAAAAAAAA 43 01 



RESULT 2 

US-08-910-864-13 

; Sequence 13, Application US/08910864 

; Patent No. 6280931 

; GENERAL INFORMATION: 

APPLICANT: SAKAMOTO, AIJI 

APPLICANT: HANAOKA, FUMIO 

TITLE OF INVENTION: METHOD FOR SPECIFICALLY AMPLIFYING A cDNA OF AN 
EXTREMELY 

TITLE OF INVENTION: SMALL QUANTITY 
NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: FISH & RICHARDSON P.C. 

STREET: 4225 EXECUTIVE SQUARE, SUITE 14 00 

CITY: LA JOLLA 

STATE : CA 

COUNTRY : USA 

ZIP : 92037 
COMPUTER READABLE FORM: 



MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patent In Release #1.0, Version #1.3 0 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/910,864 
FILING DATE: 13-AUG-1997 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 216506/1996 
FILING DATE: 16-AUG-1996 
ATTORNEY /AGENT INFORMATION: 
NAME: HAILE, LISA A. 
REGISTRATION NUMBER: 38,347 
REFERENCE/DOCKET NUMBER: 07898/017001 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 619-678-5070 
TELEFAX: 619-678-5099 
INFORMATION FOR SEQ ID NO: 13: 
SEQUENCE CHARACTER I STI CS : 
LENGTH: 1873 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 231 . . 1556 
US-08-910-864-13 

Query Match 39.3%; Score 1691.8; DB 3; Length 1873; 

Best Local Similarity 99.6%; Pred. No. 0; 

Matches 1696; Conservative 0; Mismatches 7; Indels 0; Gaps 0; 
Qy 178 TGAAACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGC 237 

II II IIIIIIIMIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 171 TGTCTCTAGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGC 230 

Qy 238 ATGCAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 2 97 

llllllllllllllllllllllllllll IIIIIIIIIIIMIIIIIIIIIIIIMIII 

Db 231 ATGCAGCCGCCTCCAAGTCTGTGCGGACCGGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 2 90 

Qy 2 98 CTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTG 357 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

Db 291 CTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTG 350 

Qy 358 CAAACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCC 417 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

Db 351 CAAACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCC 410 

Qy 418 AGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCT 477 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 411 AGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCT 470 

Qy 478 CCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 537 

IMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 
Db 4 71 CCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 530 



Qy 


538 


Db 


531 


Qy 


598 


Db 


591 


Qy 


658 


Db 


651 


Qy 


718 


Db 


711 


Qy 


778 


Db 


771 


Qy 


838 


Db 


831 


Qy 


898 


Db 


891 


Qy 


958 


Db 


951 


Qy 


1018 


Db 


1011 


Qy 


1078 


Db 


1071 


Qy 


1138 


Db 


1131 


Qy 


1198 


Db 


1191 


Qy 


1258 


Db 


1251 


Qy 


1318 


Db 


1311 



II M II I IMM II I Ml IMM II I IM IMIIMI I 



ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 657 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

ACACTTCTGAGAATTAT CTACAAGAACAAGTG CATG CGAAACGGTCCCAATAT CTTGATC 650 
GCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTAC 717 

MIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIMI 

GCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTAC 710 
AAG CTG CTGGCAGAGGACTGG CCATTTGGAG CTGAGATGTGTAAG CTGGTG CCTTTCATA 777 

MIMM MM! M II MMI M II M MIMMM II M I 

AAG CTGCTGGCAGAGGACTGGCCATTTGGAG CTGAGATGTGTAAG CTGGTGCCTTTCATA 770 
CAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATAT 837 

1 II II I IM I II I IM I III I II Mill MIMI, II 

CAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATAT 83 0 



MMMIIMI III Mill M'MI M II Ml IMM II II I M 



GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 957 

M M I II I II I IM I Ml I Ml II MIMI M II 

GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 950 
GATATAATTACGATGGACTACAAAGGAAGTTATCTG CGAATCTG CTTG CTTCATCCCGTT 1017 

I MMM I MM MMMMMII M: II IMM IMMI III I M 

GATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTT 1010 
CAGAAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTC 1077 

II II Ml II M II I III I MMM IMM I II M Ml 

CAGAAGACAGCTTTCATGCAGTTTTACA^GAC^GC^AAAGATTGGTGGCTATTCAGTTTC 1070 
TATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATG 1137 

M I ilMIIMI Ml I IM I MM I II II II II MMM 

TATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATG 1130 
TTGAGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAA 1197 

MlMi MMM M M l:MI M II IMM II Ml III M I 

TTGAGAAAGAAAAGTGG CATG CAGATTG CTTTAAATGATCACCTAAAG CAGAGACGGGAA 1190 
GTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCAC 1257 

Ml I M II M II I IM I MMM II I II II Ml II 

GTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCAC 1250 
CTCAGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTT 1317 

IMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMMIIIIIMIMII 

CTCAG CAGGATTCTGAAG CTCACTCTTTATAAT CAGAATGATCCCAATAGATGTGAACTT 1310 
TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 1377 

I MIMI II IMM II Ml II IMIIMI II M II I Ml I 

TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 1370 



Qy 


1378 


Db 


1371 


Qy 


1438 


Db 


1431 


Qy 


1498 


Db 


1491 


Qy 


1558 


Db 


1551 


Qy 


1618 


Db 


1611 


Qy 


1678 


Db 


1671 


Qy 


1738 


Db 


1731 




-L / J O 


Db 


1791 


Qy 


1858 


Db 


1851 



ATTAACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGC 1437 

.1 i MM I ■! II" I II .1! M;| I 1 ! I 

ATTAACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGC 1430 
TTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGC 14 97 

MM MMMMMMIMMMIMM MIMIII MMIMMMMMMM 

TTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGC 14 90 
TTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGC 1557 

I ! 1 1 1 1 1 1 1 1 1 1 E 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 II ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGC 1550 
TCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCAT 1617 

1 1 1 1 1 1 E 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

TCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCAT 1610 
TAAAACAAAATGAAAC^TTTGCCAAAACAAAA^ 1677 

MM IM M ||> || | |ju| || IE I II M II M I 

TAAAACAAAATGAAACATTTGCCAAAACAAA 1670 
TATTAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGC 1737 

M M M Ml II Ml IMMI I II Ml M Ml II MM I Ml 

TATTAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGC 1730 
TGTTTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTT 17 97 

I I M! M II MMM'MI I II Ml I II l! II M M 

TGTTTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAG CCTCGT CGTGAAAG CACTT 1790 
AATTTTTTACAGTTAGCACTTCAACATAGCTCTTAACA^^ 1857 

MMIMIIIIIIIIIIIIIIIIIMMMIIMIIIIMIIIMIIIIIIMIIIIIM 

AATTTTTTACAGTTAGCACTTC^CATAGCTCTTAACAACTTCCAGGATATTCACACAAC 1850 



I I 1 J I 1 I I t 1 I J I I I I J I J J I I I 



RESULT 3 

US-09-016-434-1203 

; Sequence 1203, Application US/09016434 

; Patent No. 6500938 

; GENERAL INFORMATION: 

APPLICANT: Janice Au- Young 

APPLICANT: Jeffrey J. Seilhamer 

TITLE OF INVENTION: COMPOSITION FOR THE DETECTION OF SIGNALING 
TITLE OF INVENTION: PATHWAY GENE EXPRESSION 
NUMBER OF SEQUENCES: 14 90 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 

STREET: 3174 PORTER DRIVE 

CITY: PALO ALTO 

STATE: CALIFORNIA 

COUNTRY: USA 

ZIP : 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 



OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Word Perfect 6.1 for Windows /MS -DOS 6.2 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/016 , 434 

FILING DATE: HEREWITH 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME: Zeller, Karen J . 

REGISTRATION NUMBER: 37,071 

REFERENCE/DOCKET NUMBER: PA- 00 02 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (650) 855-0555 

TELEFAX: (650) 845-4166 
INFORMATION FOR SEQ ID NO: 1203: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1470 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : S ingl e 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: GENBANK 

CLONE: gl82275 
US-09-016-434-1203 

Query Match 34.1%; Score 1466.8; DB 4; Length 1470; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1468; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 192 GAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATGCAGCCGCCTCC 251 



Db 




Qy 



252 AAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTGTCGCGGATCTG 311 



Db 




Qy 



312 GGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAAACCGCAGAGAT 371 



Db 




Qy 



372 AATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGTCTGGCGCGGTC 431 



Db 




Qy 



432 GTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCGCCACGCACCAT 4 91 



Db 




Qy 



4 92 CTCCCCTCCCCCGTGCCAAGGACCC^TCGAGATC^GGAGACTTTCAAATACATCAACAC 551 



Db 




Qy 



552 GGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAAT 611 



Db 


361 


Qy 


612 


Db 


421 


Qy 


672 


Db 


481 


Qy 


732 


Db 


541 


Qy 


792 


Db 


601 


Qy 


852 


Db 


661 


Qy 


912 


Db 


721 


Qy 


972 


Db 


781 


Qy 


1032 


Db 


841 


Qy 


1092 


Db 


901 


Qy 


1152 


Db 


961 


Qy 


1212 


Db 


1021 


Qy 


1272 


Db 


1081 


Qy 


1332 


Db 


1141 


Qy 


1392 



IIIIIIIMIMIMIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIMIIIII 

GGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAAT 42 0 
TATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCT 671 

III III II II II II Ml I lllllll MM IIIIMI Mill III Ml MM I III Ml 

TATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCT 4 8 0 
GGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGA 731 

IIIIMIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIMIMIIIIMIIIMIMI 

GGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGA 54 0 
GGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCCGT 791 

IMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIMIIIIIIIIMIIIIIIIIII 

GGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCCGT 600 
GGGAATCACTGTG CTGAGTCTATGTG CTCTGAGTATTGACAGATATCGAGCTGTTG CTTC 851 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGGAATCACTGTG CTGAGTCTATGTG CTCTGAGTATTGACAGATATCGAGCTGTTG CTTC 660 
TTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTGAT 911 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTGGAGTAGAATTAAAGGAATTGGGGTT C CAAAATGGACAG CAGTAGAAATTGTTTTGAT 72 0 
TTGGGTGGTCTCTGTGGTTCTGG CTGTCC CTGAAG CCATAGGTTTTGATATAATTACGAT 971 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 i 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTGGGTGGT CTCTGTGGTTCTGG CTGTCC CTGAAG C CATAGGTTTTGATATAATTACGAT 78 0 
GGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCTTT 1031 

1 1 1 II 1 1 M M II 1 1 II I II I II 1 1 II I II M I II I II 1 1 1 II II 1 1 II I II 1 1 1 II 1 1 

GGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCTTT 84 0 
CATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTGCC 1091 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTGCC 900 
ATTGGCCATC^CTGCATTTTTTTATAC^CTAATGACCTGTGAAATGTTGAGAAAGAAAAG 1151 

1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTGAGAAAGAAAAG 960 
TGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAACCGT 1211 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAACCGT 1020 
CTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCT 1271 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIMMIIII 

CTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCT 1080 



I M 1 1 1 M I ! J 1 1 1 M I ! 1 1 1 1 1 j 1 1 1 1 f I J 1 1 II ! 1 1 1 1 II I j f 1 1 II J j M 1 1 1 1 1 



GGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAATTGC 1391 

IIIMMIIMIIIIIIMIIIIIIIMIIIIIIIIIIMIMIIMIIIIIMIIIIII 

GGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAATTGC 1200 
TCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTGGTG 1451 

IIIIIIIIIIIIIIIMIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



1 



Db 


1201 


Qy 


1452 


Db 


1261 


Qy 


1512 


Db 


1321 


Ov 


1572 


Db 


1381 


Qy 


1632 


Db 


1441 



12 01 TCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTGGTG 1260 
CCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAAAGC 1511 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIMIIIIMIIIIIIIIIIIIIII 

CCAGTCA.TTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAAAGC 132 0 
TAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAGAA 1571 

MMMMMMMMMMMMMMMMMMMMMMMMIMMMIMM 

TAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAGAA 138 0 
GAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAAAACAAAATGAA 1631 

I ll'IMMI I II I II MMI II 1 II IIMI 'III I 

GAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAAAACAAAATGAA 144 0 



Illllllllllllllllllllllllllll 



RESULT 4 

US-09-175-658B-20 

; Sequence 20, Application US/09175658B 

; Patent No. 6372900 

; GENERAL INFORMATION: 

; APPLICANT: METALLINOS , DANIKA 

; APPLICANT: RINE, JASPER 

; APPLICANT: BOWLING, ANN 

; TITLE OF INVENTION: HORSE ENDOTHELI N - B RECEPTOR GENE AND GENE PRODUCTS 
; FILE REFERENCE: GOBR-110 

; CURRENT APPLICATION NUMBER: US/0 9/175 , 658B 
; CURRENT FILING DATE: 1998-10-20 

PRIOR APPLICATION NUMBER: 60/062,562 
PRIOR FILING DATE: 1997-10-21 
; NUMBER OF SEQ ID NOS : 25 
; SOFTWARE: PatentlnVer. 2.1 
; SEQ ID NO 20 

LENGTH: 1321 

TYPE: DNA 

ORGANISM: Horse 
US-09-175-658B-20 



Query Match 24.9%; Score 1070.4; DB 4; Length 1321; 

Best Local Similarity 88.7%; Pred. No. 6.1e-243; 

Matches 1171; Conservative 0; Mismatches 146; Indels 3; Gaps 1; 
Qy 227 CAGGTAGCAGCATGCAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTC 2 86 

IIIIIIIMIIIIIIIIII I 1 1 1 1 Mill lllllll Mill MMM I I 

Db 1 CAGGTAGCAGCATGCAGCCTCTGCCAACCCTGTGTGGACGCGTTCTGGTGGCGCTGATCC 60 

Qy 287 TTGCCTGCGGCCTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCA 34 6 

lllllllllll II I II II llllll Mill I MINIM I 

Db 61 TTGCCTGCGGCGTGGCAGGGGTCCAGGGAGAAGAGAGGAGATTCCCGCCGGCCAGGGCCA 120 



Qy 347 CTCCG - - - CTTTTGCAAACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCA 4 03 

Mill Ml II I I IMIIIIIMIIMI II MIMMIII Mill I 

Db 121 CTCCGCCACTTCTGGGGTCTGAAGAGATAATGACGCCCCCGACTAAGACCTCCTGGCCGA 18 0 



Qy 


404 


Db 


181 


Qy 


464 


Db 


241 


Qy 


524 


Db 


301 


Qy 


584 


Db 


361 


Qy 


644 


Db 


421 


Qy 


704 


Db 


481 


Qy 


764 


Db 


541 


Qy 


824 


Db 


601 


Qy 


884 


Db 


661 


Qy 


944 


Db 


721 


Qy 


1004 


Db 


781 


Qy 


1064 


Db 


841 




1194 

J. X £t 1 


Db 


901 


Qy 


1184 


Db 


961 



AGGGTTCCAACGCCAGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACA 463 

III lllllllllll II IMIMI I MINI II I lllllllll II I 

CGGGGTCCAACGCCAGCGTGCCGCGGTCATCAGCACCTCCGCAAATGCCTAAAGCAGGGA 24 0 



MINI Ml I I II Ml II I llllllllllll II II IIIIIM 



TCAAGGAGACTTTCAAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGA 583 

Illlllllllllllll Mlllllllll II lllllllllll lllllllllllill I 
TCAAGGAGACTTTCAAGTACATCAACACAGTAGTGTCCTGCCTAGTGTTCGTGCTGGGCA 360 



lllllll lllllllllll llllllll II llllll lilllllllll Mill 



CCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCC 703 

I IIIMIIIIIIIIMIII lllllll MMMMIMM III llllllll III 

CTAATATCTTGATCGCCAGCCTGGCTCTCCGAGACCTGCTGCAAATCATCATTGACGTCC 4 80 
CTATCAATGT CTACAAG CTG CTGG CAGAGGACTGGCCATTTGGAG CTGAGATGTGTAAGC 763 

I IIIIIIIIIIMIIIIIIMIM IMIMMIM lllllll IMMIIMIMI 

CCATCAATGTCTACAAGCTGCTGGCTGAGGACTGGCCCTTTGGAGTCGAGATGTGTAAGC 54 0 
TGGTGCCTTTCATACAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGA 823 

Illllllllllllllllll lllllllllll lllllllllllllllll llllllll I 

TGGTGCCTTTCATACAGAAGGCCTCCGTGGGCATCACTGTGCTGAGTCTGTGTGCTCTAA 600 
GTATTGACAGATATCGAG CTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTT CCAA 8 83 

Illlllllllllllllllllllllllll I I lllllllllllill lllllllll 

GTATTGACAGATATCGAG CTGTTGCTTCCTTGGAG CGAATTAAAGGAATTCGGGTTCCAA 660 
AATGGACAGCAGTAGAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTG 943 

Mlllllllllllllllllllllll llllllllllllllllllllllllllllllllll 

AATGGACAGCAGTAGAAATTGTTTTAATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTG 720 
AAGCCATAGGTTTTGATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCT 1003 

Mill I IMIMMIM Mill Illlllllllllllllllllllllllllll 

AAGCCGTGGGTTTTGATATGATTACCGCTGACTACAAAGGAAGTTATCTGCGAATCTGCC 780 



lllllllllll MUM Mill Illllllllllllllllll II II II III 



GGCTGTTCAGTTTCTATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAA 1123 

MM II IIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIII I I 

GGCTATTTAGTTTCTATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACCTTGA 900 
TGACCTGTGAAATGTTGAGAAAGAAAAGTGG CATG CAGATTG CTTTAAATGAT CACCTAA 1183 

I II I I l"l II I lllllllllll I 1 1 1 1 j 1 1 ' 1 1 [ i Ml 

TGAC CTGTGAAATGTTGAGAAAGAAGAGTGG CATGCAAATTG CTTTAAATGAT CACTTAA 960 



MIIIMI IM Mill II II IIIIMIIIIIMIIIIMIIM II 



Qy 1244 GGCTTCCCCTTCACCTCAGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCA 1303 



Db 


1021 


Qy 


1304 


Db 


1081 


Qy 


1364 


Db 


1141 


Qy 


1424 


Db 


1201 


Qy 


1484 


Db 


1261 



I IIIIIIIIIIMIIIIII II II I llllllllll i U'l l I 

GGCTTCCTCTTCACCTCAGCAGGATTTTGAAACACACTCTTTATGATCAGAATGATCCCC 108 0 
ATAGATGTGAACTTTTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTT 1363 

IMI II M II I! I llllllllllllllll Mill I II I I 

ATAGATGTGAACTTTTGAGCTTTTTGTTGGTATTGGACTACATTGGCATCAACATGGCCT 114 0 



I II I i II! Mill IIMMIMMMIMIIIMMIMM 



IMIIMI llllllllll llllllllll II MM llllllllll I MM II III I 



AAAAGC^GTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCA 1543 

I IMIIMI lll l! l 'I II, l! I II I ll'll 1 I III Ih I 

ACAAGCAGTCATGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCA 132 0 



RESULT 5 

US-09-016-434-1257 

; Sequence 1257, Application US/09016434 
; Patent No. 6500938 
; GENERAL INFORMATION: 

APPLICANT: Janice Au-Young 

APPLICANT: Jeffrey J. Seilhamer 

TITLE OF INVENTION: COMPOSITION FOR THE DETECTION OF SIGNALING 
TITLE OF INVENTION: PATHWAY GENE EXPRESSION 
NUMBER OF SEQUENCES: 14 90 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 

STREET: 3174 PORTER DRIVE 

CITY: PALO ALTO 

STATE : CALI FORNIA 

COUNTRY: USA 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Word Perfect 6.1 for Windows /MS - DOS 6.2 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/0 16 , 434 

FILING DATE: HEREWITH 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME: Zeller, Karen J . 

REGISTRATION NUMBER: 37,071 

REFERENCE/DOCKET NUMBER: PA- 00 02 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (650) 855-0555 

TELEFAX: (650) 845-4166 



INFORMATION FOR SEQ ID NO: 1257: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 4079 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: GENBANK 

CLONE: g219649 
US-09-016-434-1257 

Query Match 9.2%; Score 395.8; DB 4; Length 4 079; 

Best Local Similarity 65.7%; Pred. No. 1.3e-83; 

Matches 634; Conservative 0; Mismatches 307; Indels 24; Gaps 3; 



Qy 505 TGCCAAGGACCCATCGAGATC^GGAGACTTTC^^TAC^TC^ACACGGTTGTGTCCTGC 564 

MM I II I II I MIMMMMM Mill II I II II 

Db 685 TGCCC^CAGCAGACTAAAATTACTTC^GCTTTCAAATACATTAACACTGTGATATCTTGT 744 

Qy 565 CTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAAC 624 

I I Ml I MM M I Mill I II M II II M II III Mill 

Db 74 5 ACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTACCAGAAC 8 04 

Qy 625 AAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTG 684 

II II Ml I M II Mill MM Mill I M II llllllll I 

Db 805 AAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATC 864 
Qy 68 5 i^C^TCGTC^TTGACATCCCTATCAATGTCTAC^AGCTGCTGGCAGAGGACTGGCC 74 0 

I I llllllll II ;l I! | lllllllllll I I MINI 

Db 865 TATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTT 924 
Qy 741 ATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCC 789 

Mill I I II MINI I II II I Mill I II I 

Db 925 GATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCG 984 

Qy 790 GTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCT 84 9 

HIM Mill II II I II II Mill Ml I Ml I II MM Mill 

Db 985 GTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCC 1044 

Qy 850 TCTTGGAGTAGAATTAAAGGAATTGGGGTTCC^AAATGGACAGCAGTAGAAATTGTTTTG 909 

M MUM I II I I II I MM II II I llllllll I 

Db 104 5 TCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCC 1104 

Qy 910 ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 969 

M MM III I I Mill I llllllll II II || | || | | 
Db 1105 ATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCC 1164 

Qy 970 ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 102 9 

llllllll II II III MM II II I II 

Db 1165 TTTGAATATAGGGGTGAACAG CATAAAACCTGTATG CTCAATG CC ACATCAAAA 1218 

Qy 1030 TTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 108 9 

MMM MM III I I III II llllllll Ml I lllllllllll II 

Db 1219 TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 1278 
Qy 1090 CCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG AGAAAG 114 6 

M MM Mill I II II M II Mill MM WWW WW I 



Db 



1279 CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 1338 



Qy 114 7 AAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAA 1206 

II I II II Mill I I III II II llllll I II IIIIIIII III 
Db 133 9 AATGGCAGCTTGAGAATTG C C CTCAGTGAACATCTTAAGCAG CGTCGAGAAGTGG CAAAA 1398 

Qy 1207 ACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGG 1266 

II II II III 1 1 1 1 I I Mill II llllll I II llllll I III I 

Db 13 99 ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGT 1458 

Qy 1267 ATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTT 132 6 

II Mill III I Mill I I II II lllllll I I II M 

Db 1459 ATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTC 1518 

Qy 1327 CTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCA 1386 

I II I 1 1 1 1 II II Mill III MM I lllllll II II Mill 

Db 1519 TTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCC 1578 

Qy 1387 ATTG CTCTGTATTTGGTGAG CAAAAGATTCAAAAACTG CTTTAAGTCATG CTTATGCTGC 1446 

II IMMIIMM MIIIMI I III Mill II M IMMIII I llllll 

Db 1579 ATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGC 1638 

Qy 1447 TGGTG 14 51 

II II 

Db 1639 TGCTG 1643 



RESULT 6 
US-08-121-446-1 

; Sequence 1, Application US/ 08 12 144 6 
; Patent No. 6313276 
; GENERAL INFORMATION: 

APPLICANT: IMURA, HIROO 

APPLICANT: NAKAO, KAZUWA 

APPLICANT: NAKANISHI , SHIGETADA 

TITLE OF INVENTION: A HUMAN ENDOTHELIN RECEPTOR 
NUMBER OF SEQUENCES : 4 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: MORRISON & FOERSTER 

STREET: 755 Page Mill Road 

CITY: Palo Alto 

STATE: California 

COUNTRY : USA 

ZIP: 94304-1018 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/121,446 

FILING DATE: 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/911,684 

FILING DATE: 10-JUL-1992 
ATTORNEY/AGENT INFORMATION: 



NAME: CIOTTI, THOMAS E. 

REGISTRATION NUMBER: 21,013 

REFERENCE/DOCKET NUMBER: 29900-20324.00 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 813-5600 

TELEFAX: (415) 494-0792 

TELEX: 706141 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 4105 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : s ingl e 

TOPOLOGY: linear 
FEATURE : 

NAME/ KEY: CDS 

LOCATION: 485 . . 1768 
FEATURE : 

NAME/KEY: mat_peptide 

LOCATION: 545 
US-08-121-446-1 

Query Match 9.2%; Score 395.8; DB 4; Length 4105; 

Best Local Similarity 65.7%; Pred. No. 1.3e-83; 

Matches 634; Conservative 0; Mismatches 307; Indels 24; Gaps 3; 



Qy 505 TGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATa^ACACGGTTGTGTCCTGC 564 

I I I I I II I II I lllllllllllll Mill II I II II 

Db 68 9 TGCCCACAGCAGACTAAAATTACTTCAGCTTTCAAATACATTAACA 74 8 

Qy 565 CTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAAC 624 

MINI 1 1 1 1 II I Mill I II II II II II II Ml Mill 

Db 74 9 ACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTACCAGAAC 808 

Qy 62 5 AAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTG 684 

II II Ml I II II Mill MM Mill I II II llllllll I 

Db 809 AAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATC 868 
Qy 685 CACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCC 74 0 

I I llllllll lllllllllllll I III MM I I MM 

Db 869 TATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTT 928 
Qy 741 ATTTGGAG CTGAGATGTGTAAG CTGGTG C CTTT CATACAGAAAG C CT CC 78 9 

Mill I I II llllll I II II I Mill MM 

Db 92 9 GATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCG 988 

Qy 790 GTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCT 84 9 

Mill Mill II II I II II Mill III II II 1 1 1 1 Mill 

Db 989 GTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCC 104 8 

Qy . 850 TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 90 9 

M llllll I II I MM 1 1 1 1 || || I llllllll | 

Db 104 9 TCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCC 1108 

Qy 910 ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 969 

II Ml I I II I I Mill I llllllll M II II I II I I 

Db 1109 ATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTArrr llfift 



Qy 97 0 ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 102 9 

I II II I II II II III I I I I II II I II 

Db 1169 TTTGAATATAGGGGTGAACAG CATAAAACCTGTATGCTCAATG CC ACATCAAAA 1222 

Qy 103 0 TTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 108 9 

I 1 1 1 1 III I I III II Ml.. Ill I IMIIIIIMI II 

Db 1223 TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 1282 

Qy 1090 CCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG - - - AGAAAG 114 6 

II 1 1 1 1 MINI I II II II II Mill Mill MUM MM I 

Db 1283 CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 1342 

Qy 114 7 AAAAGTGGCATG CAGATTG CTTTAAATGATCACCTAAAG CAGAGACGGGAAGTGG CCAAA 1206 

II I II II Mill I I III II II MUM I II llllllll III 

Db 1343 AATGG CAGCTTGAGAATTG CCCTCAGTGAACATCTTAAGCAG CGTCGAGAAGTGG CAAAA 14 02 

Qy 12 07 ACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGG 1266 

II II II III MM I I Mill II llllll I II llllll I Ml I 

Db 14 03 ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGT 1462 

Qy 1267 ATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTT 1326 

M Mill III I Mill I I II II MINI I I II II 

Db 14 63 ATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTC 1522 

Qy 1327 CTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCA 1386 

I II I MM II II Mill III MM I IMMII II II Mill 

Db 1523 TTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCC 1582 

Qy 1387 ATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGC 144 6 

II MMIIIMM MIIIMI I Ml Mill II II IIIIMM I llllll 

Db 1583 ATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGC 1642 

Qy 1447 TGGTG 1451 

II II 

Db 1643 TGCTG 1647 



RESULT 7 

PCT-US92-02091-1 

Sequence 1, Application PC/TUS9202091 
GENERAL INFORMATION: 

APPLICANT: Battey Jr., James F. 
APPLICANT: Cor jay, Martha H. 
APPLICANT: Feldman,. Richard I. 
APPLICANT: Harkins # Richard N. 

TITLE OF INVENTION: RECEPTORS FOR BOMBESIN -LIKE PEPTIDES 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Edwin P. Ching 
STREET: 1501 Harbor Bay Parkway 
CITY: Alameda 
STATE : CA 
COUNTRY : USA 
ZIP: 94501 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 



OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/02091 

FILING DATE: 19920313 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/426,150 

FILING DATE: 24-OCT-1989 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/533,659 

FILING DATE: 05-JUN-1990 
ATTORNEY/AGENT INFORMATION: 

NAME: Ching, Edwin P. 

REGISTRATION NUMBER: 34 090 

REFERENCE/DOCKET NUMBER: A-0092C 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-266-7476 

TELEFAX: 415-266-7400 
; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1700 base pairs 

TYPE: NUCLEIC ACID 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL: NO 
ORIGINAL SOURCE: 

ORGANISM: Mus musculus 

CELL LINE: Swiss 3T3 
IMMEDIATE SOURCE: 

LIBRARY: Lambda GT10 
FEATURE : 

NAME/ KEY : CDS 

LOCATION: 378 . . 1532 
PCT-US92-02091-1 



Query Match 3.1%; Score 132.2; DB 5; Length 1700; 

Best Local Similarity 56.9%; Pred. No. 1.4e-21; 

Matches 242; Conservative 0; Mismatches 183; Indels 0; Gaps 0; 
Qy 535 TTC^W^TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAAC 594 

1 1 1 1 II III I I I II I I III I Mill I II III II III 

Db 4 95 TTCATCTATGTCATCCCTGCAGTTTATGGGCTTATCATCGTGATAGGTCTTATTGGCAAC 554 

Qy 595 TCCACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTG 654 

Ml II I I II III II I Mill lllllllllll II II I 1 1 

Db 555 ATCACGCTCATC^GATCTTCTGC^CGGTC^GTCCATGCGAAACGTGCCAAACCTGTTC 614 



Qy 655 ATCGCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTC 714 

III I III Mill MMMMMIMM I II I I MM I III I 

Db 615 ATCTCTAGCCTGGCTTTGGGAGACCTGCTGCTGCTGGTGACATGCGCCCCTGTGGATGCC 674 

Qy 715 TACAAG CTGCTGG CAGAGGACTGG C CATTTGGAGCTGAGATGTGTAAGCTGGTG CCTTTC 774 

MM Mill II MM MMM II II Ml I II II 

Db 675 AGCAAGTACCTGGCTGACAGGTGGCTATTTGGCAGAATTGGCTGCAAACTGATCCCCTTT 734 



Qy 775 ATACAGAAAG CCT CCGTGGGAATCACTGTG CTGAGT CTATGTGCTCTGAGTATTGACAGA 834 

Mill I II Mill I MM II II II III MUM 

Db 735 ATACAACTTACTTCAGTGGGGGTGTCTGTCTTCACACTTACGGCACTGTCAGCTGACAGG 794 

Qy 835 TAT CGAG CTGTTG CTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAG CA 8 94 

II III III I II I I I III II 

Db 795 TACAAAGCC^TTGTACGGCCAATGGATATCCAGGCATCCCATGCCCTGATGAAGATCTGT 854 

Qy 895 GTAGAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGT 954 

I II II llllll III I llllll II I MM MM II II I I 

Db 855 CTCAAAGCTGCTTTGATCTGGATTGTCTCTATGTTGTTGGCCATCCCAGAGGCTGTGTTT 914 

Qy 955 TTTGA 959 

I III 

Db 915 TCTGA 919 



RESULT 8 

US-08-724-394A-20 

Sequence 20, Application US/08724394A 
Patent No. 5872237 
GENERAL INFORMATION : 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Feder, John N. 
Kronmal , Gregory S . 
Lauer, Peter M. 
Ruddy, David A. 
Thomas, Winston 
Tsuchihashi , Zenta 
Wolff, Roger K. 

TITLE OF INVENTION: Megabase Transcript Map: No. 5872237el 
TITLE OF INVENTION: Sequences and Antibodies Thereto 
NUMBER OF SEQUENCES: 31 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: TOWNSEND and TOWNSEND and CREW LLP 

STREET: Two Embarcadero Center, 8th Floor 

CITY: San Francisco 

STATE : CA 

COUNTRY : USA 

ZIP : 94111-3834 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08 /724 , 3 94A 

FILING DATE: 01-OCT-1996 

CLASSIFICATION: 536 
ATTORNEY/AGENT I N FORMAT I ON : 

NAME: Fitts, Renee A. 

REGISTRATION NUMBER: 35,136 

REFERENCE/DOCKET NUMBER: 017957-000100 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-576-02 00 

TELEFAX: 415-576-03 00 
INFORMATION FOR SEQ ID NO: 20: 
SEQUENCE CHARACTERISTICS: 



LENGTH: 246240 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : not relevant 
TOPOLOGY: not relevant 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/KEY: misc_f eature 
LOCATION: 1. .24624 0 

OTHER INFORMATION: /note= " HLA-H . CONTIG" 
US-08-724-394A-20 

Query Match 2.7%; Score 114.6; DB 2; Length 246240; 

Best Local Similarity 82.0%; Pred. No. 1.8e-16; 

Matches 132; Conservative 0; Mismatches 29; Indels 0; Gaps 0; 
Qy 5 CATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCCAGGT 64 

III II 1 1 1 1 1 1 1 1 Mill 1 1 1 1 II 1 1 1 1 llllll lllllllll 

Db 18 0691 CATCCCTACGGGGAACTCCAGCCAGTTTGAGCGACACAGATCTGGAGAGCGCTCCCAGGT 

180750 

Qy 65 AGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGGAGGA 124 

Mill llllllllllll MINI lllllllll lllllllllll lllllllll 

Db 180751 AGGCAATTGCCCCGGTGGAACGCCTCACCAGAGCAGCACGTGGCAGGCCCTCGTGGAGGA 

180810 

Qy 12 5 TCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGG 165 

Mill IIIMIIIIIIIM IMIIIIIIIIII 1 1 1 1 I 

Db 18 0811 TCAACGCAGTGGCTGAACACCGGGAAGGAACTGGCACTTTG 180851 



RESULT 9 

US-08-724-394A-21 

Sequence 21, Application US/08724394A 
Patent No. 5872237 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Feder, John N. 
Kronmal, Gregory S. 
Lauer, Peter M. 
Ruddy, David A. 
Thomas, Winston 
Tsuchihashi , Zenta 
Wolff, Roger K. 

TITLE OF INVENTION: Megabase Transcript Map: No. 5872237el 
TITLE OF INVENTION: Sequences and Antibodies Thereto 
NUMBER OF SEQUENCES: 31 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: TOWNS END and TOWNSEND and CREW LLP 

STREET: Two Embarcadero Center, 8th Floor 

CITY: San Francisco 

STATE : CA 

COUNTRY : USA 

ZIP: 94111-3834 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patent In Release #1.0, Version #1.30 



CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/724 , 394A 

FILING DATE: 01-OCT-1996 

CLASSIFICATION: 53 6 
ATTORNEY/AGENT INFORMATION: 

NAME: Fitts, Renee A. 

REGISTRATION NUMBER: 35,136 

REFERENCE /DOCKET NUMBER: 017957-000100 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-576-0200 

TELEFAX: 415-576-0300 
INFORMATION FOR SEQ ID NO: 21: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 246240 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : not relevant 

TOPOLOGY: not relevant 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/KEY : misc_f eature 

LOCATION: 1.. 24624 0 

OTHER INFORMATION: /note= "HLA-H . CONTIG" 
US-08-724-394A-21 

Query Match 2.7%; Score 114.6; DB 2; Length 246240; 

Best Local Similarity 82.0%; Pred. No. 1.8e-16; 

Matches 132; Conservative 0; Mismatches 29; Indels 0; Gaps 0; 
Qy 5 CATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCCAGGT 64 

Ml M 1 1 1 1 1 1 1 1 Mill 1 1 1 1 II 1 1 1 1 IIIMI lllllllll 

Db 180691 CATCCCTACGGGGAACTCCAGCCAGTTTGAGCGACACAGATCTGGAGAGCGCTCCCAGGT 

180750 



Qy 65 AGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGGAGGA 124 

Mill llllllllllll IIIMI lllllllll MIMIIMM IMIIIIII 

Db 18 0751 AGGCAATTGCCCCGGTGGAACGCCTCACCAGAGCAGCACGTGGCAGGCCCTCGTGGAGGA 

180810 



Qy 12 5 TCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGG 165 

Mill IIIIIIIIIIIIM II ■ ;i MM I 

Db 18 0811 TCAACGCAGTGGCTGAACACCGGGAAGGAACTGGCACTTTG 180851 



RESULT 10 
US-08-724-394A-22 

; Sequence 22, Application US/08724394A 
; Patent No. 5872237 
; GENERAL INFORMATION: 

APPLICANT: Feder, John N. 

APPLICANT: Kronmal, Gregory S. 

APPLICANT: Lauer, Peter M. 

APPLICANT: Ruddy, David A. 

APPLICANT: Thomas, Winston 

APPLICANT: Tsuchihashi, Zenta 

APPLICANT: Wolff, Roger K. 

TITLE OF INVENTION: Megabase Transcript Map: No. 5872237el 
TITLE OF INVENTION: Sequences and Antibodies Thereto 



NUMBER OF SEQUENCES: 31 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: TOWNSEND and TOWN SEND and CREW LLP 

STREET: Two Embarcadero Center, 8th Floor 

CITY: San Francisco 

STATE : CA 

COUNTRY : USA 

ZIP : 94111-3834 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/724 , 3 94A 

FILING DATE: 01-OCT-1996 

CLASSIFICATION: 536 
ATTORNEY/AGENT INFORMATION: 

NAME: Fitts, Renee A. 

REGISTRATION NUMBER: 35,136 

REFERENCE/DOCKET NUMBER: 017957-000100 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-576-0200 

TELEFAX: 415-576-0300 
INFORMATION FOR SEQ ID NO: 22: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 246240 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : not relevant 

TOPOLOGY: not relevant 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/KEY: misc_feature 

LOCATION: 1.. 24624 0 

OTHER INFORMATION: /note= "HLA-H . CONTIG" 
US-08-724-394A-22 

Query Match 2.7%; Score 114.6; DB 2; Length 24 624 0; 

Best Local Similarity 82.0%; Pred. No. 1.8e-16; 

Matches 132; Conservative 0; Mismatches 29; Indels 0; Gaps 0 
Qy 5 CATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCCAGGT 64 



Db 

180750 



180691 




Qy 



65 



AGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGGAGGA 124 



Db 

180810 



180751 




Qy 



125 




Db 



180811 



RESULT 



11 



PCT-US92-02091-5 

; Sequence 5, Application PC/TUS9202091 
; GENERAL INFORMATION: 

APPLICANT: Battey Jr., James F. 

APPLICANT: Cor jay, Martha H. 

APPLICANT: Feldman, Richard I. 

APPLICANT: Harkins , Richard N. 

TITLE OF INVENTION: RECEPTORS FOR BOMBESIN -LIKE PEPTIDES 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Edwin P. Ching 

STREET: 1501 Harbor Bay Parkway 

CITY: Alameda 

STATE : CA 

COUNTRY : USA 

ZIP: 94501 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/02091 

FILING DATE: 19920313 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/426,150 

FILING DATE: 24-OCT-1989 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/533,659 

FILING DATE: 05-JUN-1990 
ATTORNEY/AGENT INFORMATION: 

NAME: Ching, Edwin P. 

REGISTRATION NUMBER: 34 090 

REFERENCE/DOCKET NUMBER: A-0092C 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-266-7476 

TELEFAX: 415-266-7400 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1584 base pairs 

TYPE: NUCLEIC ACID 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL: NO 
ORIGINAL SOURCE: 

ORGANISM: Rattus rattus 

TISSUE TYPE: Esophagus 
FEATURE : 

NAME/KEY: CDS 

LOCATION: 132.. 13 04 
PCT-US92-02091-5 



Query Match 2.5%; 
Best Local Similarity 54.8%; 
Matches 215; Conservative 



Score 108.8; DB 5; Length 1584; 
Pred. No. 4.3e-16; 
0; Mismatches 177; Indels 0; Gaps 0; 



Qy 572 TCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAACAAGTGCA 631 

II I I I I I I I II III II II III II II II I I I I II 

Db 292 TCTCGGTGGGCTTGCTGGGCAACATC^ 351 

Qy 632 TGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTGCACATCG 691 

I I I I I I Mill Mill III I I I Mill MIMIMMMM I 
Db 352 TGCGGAGTGTCCCCAACATCTTCATCTCTAACCTGGCTGCGGGAGACCTGCTGCTGCTGC 411 

Qy 692 TCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCCATTTGGAGCTG 751 

I I I MM I III III III II III M II 

Db 412 TGACCTGCGTCCCAGTGGATGCCTCCCGATACTTCTTTGATGAATGGGTGTTCGGCAAGC 471 

Qy 752 AGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTC 811 

I II II II III III III MM Mill I I III I I II 

Db 4 72 TGGGCTGCAAACTCATCCCAGCCATCCAGCTCACCTCGGTGGGGGTTTCCGTGTTCACTC 531 

Qy 812 TATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCTTCTTGGAGTAGAATTAAAGGAA 871 

I II II II llllll II Mill II I II I 

Db 532 TCACGGCCCTCAGCGCTGACAGGTACAGAGCTATCGTGAACCCCATGGACATGCAGACGT 5 91 

Qy 872 TTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTC 931 

Ml II Mill I I M II II 1 1 1 1 II M I M 1 1 I 

Db 5 92 CTGGTGTGGTGCTGTGGACCAGTTTGAAGGCCGTGGGCATCTGGGTGGTCTCTGTGCTGT 651 

Qy 932 TGGCTGTCCCTGAAGCCATAGGTTTTGATATA 963 

MINIMUM II I M II II 

Db 652 TGGCTGTCCCTGAGGCTGTGTTTTCGGAAGTA 683 



RESULT 12 

US-09-175-658B-25/C 

; Sequence 25, Application US/09175658B 

; Patent No. 6372900 

; GENERAL INFORMATION: 

; APPLICANT: METALLINOS , DANIKA 

; APPLICANT: RINE, JASPER 

; APPLICANT: BOWLING, ANN 

; TITLE OF INVENTION: HORSE ENDOTHEL I N - B RECEPTOR GENE AND GENE PRODUCTS 
; FILE REFERENCE: GOBR-110 

; CURRENT APPLICATION NUMBER: US/09/ 175 , 658B 

; CURRENT FILING DATE: 1998-10-20 

; PRIOR APPLICATION NUMBER: 60/062,562 

; PRIOR FILING DATE: 1997-10-21 

; NUMBER OF SEQ ID NOS : 25 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 25 

LENGTH: 801 

TYPE : DNA 

ORGANISM: Horse 

FEATURE : 

OTHER INFORMATION: Uncertain of the nucleotide sequence at positions 
OTHER INFORMATION: 30, 54, 286, 436, 445, 542, 614, 617, 624, 641, 
OTHER INFORMATION: 731, 746, 753, 770, 775 and 793. 
US-09-175-658B-25 



Query Match 



2.5%; Score 107.2; DB 4; Length 801; 



r 



Best Local Similarity 86.8%; Pred. No. 7.6e-16; 

Matches 118; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 
Qy 14 08 AAAAGATTCAAAAACTG CTTTAAGTCATG CTTATG CTGCTGGTG CCAGTCATTTGAAGAA 1467 

Ml II I I I I 1 1 1 1 IMIMIIIII llllll llllllllllll 

Db 223 AAACGAGTTATTTGTTTTGTA CAGTCGTG CTTATG CTG CTGGTG C CAATCATTTGAAGAA 164 

Qy 14 68 AAACAGTC CTTGGAGGAAAAG CAGTCGTG CTTAAAGTTCAAAGCTAATGATCACGGATAT 1527 

Illlllllllllll II llllllll lllllllllllllllllllllllllllllllll 

Db 163 AAACAGT C CTTGGAAGACAAG CAGTCATG CTTAAAGTTCAAAGCTAATGATCACGGATAT 104 

Qy 1528 GACAACTTCCGTTCCA 1543 

llllllllllllllll 
Db 103 GACAACTTCCGTTCCA 88 



RESULT 13 
US-09-120-772-1 

; Sequence 1, Application US/09120772 

; Patent No. 6143521 

; GENERAL INFORMATION: 

APPLICANT: LANE, PAMELA 

APPLICANT: TSUI, PING 

APPLICANT: ELSHOURBAGY, NABIL 

TITLE OF INVENTION: HUMAN BOMBESIN RECEPTOR SUBTYPE 
TITLE OF INVENTION: 3 
NUMBER OF SEQUENCES: 2 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Ratner & Prestia 

STREET: P.O. Box 980 

CITY: Valley Forge 

STATE : PA 

COUNTRY : USA 

ZIP: 19482 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/120 , 772 

FILING DATE: 22-JUL-1998 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 

NAME: Prestia, Paul F 

REGISTRATION NUMBER: 23,031 

REFERENCE/DOCKET NUMBER: GP-70505 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 610-407-0700 

TELEFAX: 610-407-0700 

TELEX: 846169 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1205 base pairs 



TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-09-120-772-1 

Query Match 2.5%; Score 106.6; DB 3; Length 12 05; 

Best Local Similarity 48.5%; Pred. No. 1.3e-15; 

Matches 425; Conservative 0; Mismatches 434; Indels 18; Gaps 4; 
Qy 578 TGGGGAT(^TCGGGAACTCCACACTTCTGAGAATTATCTACAAGAAC^AGTGCATGCGAA 637 

1 1 1 1 MM II II I I II I I I I I I Mill III I Mill II 

Db 181 TGGGCATCCTTGGAAATGCTATTCTCATCAAAGTCTTTTTCAAGACCAAATCCATGG^ 24 0 

Qy 638 ACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTG 697 

I III Mill II III Mill Mill I Mill II I I I I I I 
Db 241 CAGTTCCAAATATTTTCATCACCAGCCTGGCTTTTGGAGATCTTTTACTTCTGCTAACTT 3 00 

Qy 698 ACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGT 757 

I M I III I II Mill I MM II III I 

Db 3 01 GTGTGCCAGTGGATGCAACTCACTACCTTGCAGAAGGATGGCTGTTCGGAAGAATTGGTT 360 

Qy 758 GTAAGCTGGTGCCTTTCATACAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTG 817 

Mill II I lllllll II I II II II I I III I I II 

Db 361 GTAAGGTGCTCTCTTTCATCCGGCTCACTTCTGTTGGTGTGTCAGTGTTCACATTAGCAA 42 0 

Qy 818 CTCTGAGTATTGACAGATATCGAGCTGTTG CTT CTTGGAGTAGAATTAAAGGAATTGGGG 877 

III M MIIIIMI II MM I I I 

Db 421 TTCTCAGCGCTGACAGATACAAGGCAGTTGTGAAGCOVCTTGAGCGACAGCCCTCCAATG 480 

Qy 878 TTCCAAAATGGACAGCAGTAGAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTG 937 

III III II II I I III I II III II I I III 

Db 4 81 CCATCCTGAAGACTTGTGTAAAAGCTGGCTGCGTCTGGATCGTGTCTATGATATTTGCTC 54 0 

Qy 938 TCCCTGAAGCCATAGGTTTTGATATA ATTACGATGGACTACAAAGGAAGTTATCT 992 

I Mill II III II II II II III I I II I I I I 

Db 541 TACCTGAGGCTATATTTTCAAATGTATACIACTTTTCGAGATCCCAATAAAAATATGACAT 600 

Qy 993 GCGAATCTGCTTGCT - TCATCCCGTTCAGAAGACAGCTTTCATGCAGTTTTACAAGACAG 1051 

Mill I II I MM II MM II III 

Db 601 TTGAATCATGTACCTCTTATCCTGTCTCTAAGAAGCTCTTGCAAGAAATACATTCTCTGC 660 

Qy 1052 CAAAAGATTGGTGGCTGTTC^GTTTCTATTTCTGCTTGCCATTGGCCATCACTGCATTTT 1111 

I lllllll Mil I I I I I I II 

Db 661 TGTGCTTCTTAGTGTTCTACATTATTCCACTCTCTATTATCTCTGTCTACTATTCCTTGA 72 0 

Qy 1112 TTTATACACTAATGACCTGTGAAATGTTGAGAAAGAAAAGTGGCATGCAGATTGCTTTAA 1171 

II II II I III I III II I I 

Db 721 TTGCTAGGACCCTTTACAAAAGCACCCTGAACATACCTACTGAGGAACAAAGCCATGCCC 78 0 

Qy 1172 ATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCT 1231 

I I II I I II I I I MM Ml II II MM I I I 

Db 781 GTAAGCAGATTGAATCCCGAAAGAGAATTGCCAGAACGGTATTGGTGTTGGTGGCTCTGT 84 0 

Qy 1232 TTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCTGAAGCTCACTCTTTATAATC 12 91 

I M I I I II I I I I I I I II MMIM Ml I III I II 

Db 841 TTGCCCTCTGCTGGTTGCCAAATCACCTC CTGTACCTCTACCATTCATTCA 891 



Qy 



1292 



AGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTGTTGGTATTGGACTATATTGGTA 1351 



Db 



892 




Qy 



1352 



TCAACATGGCTTCACTGAATTCCTGCATTAACCCAATTGCTCTGTATTTGGTGAGCAAAA 1411 



Db 



949 




Qy 



1412 



GATT CAAAAA CTG CTTTAAGTCATG CTTATG CTG CTG 1448 



Db 



1009 




RESULT 14 

US-09-016-434-1275 

; Sequence 1275, Application US/09016434 

; Patent No. 6500938 

; GENERAL INFORMATION: 

APPLICANT: Janice Au-Young 

APPLICANT: Jeffrey J. Seilhamer 

TITLE OF INVENTION: COMPOSITION FOR THE DETECTION OF SIGNALING 
TITLE OF INVENTION: PATHWAY GENE EXPRESSION 
NUMBER OF SEQUENCES: 1490 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 

STREET: 3174 PORTER DRIVE 

CITY: PALO ALTO 

STATE: CALIFORNIA 

COUNTRY : USA 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Word Perfect 6.1 for Windows /MS -DOS 6.2 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/016,434 

FILING DATE: HEREWITH 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME: Zeller, Karen J. 

REGISTRATION NUMBER: 37,071 

REFERENCE/DOCKET NUMBER: PA- 00 02 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (650) 855-0555 

TELEFAX: (650) 845-4166 
INFORMATION FOR SEQ ID NO: 1275: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1413 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 



IMMEDIATE SOURCE: 
LI BRARY : GENBANK 
CLONE: g291876 
US-09-016-434-1275 

Query Match 2.5%; Score 106.6; DB 4; Length 1413; 

Best Local Similarity 48.5%; Pred. No. 1.4e-15; 

Matches 425; Conservative 0; Mismatches 434; Indels 18; Gaps 4; 



Qy 578 TGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTAC^GAACAAGTGCATGCGAA 637 

1 1 1 1 III I II II II II I I I I II Mill III I Mill II 

Db 328 TGGGCATCCTTGGAAATGCTATTCTC^TCAAAGTCTTTTT(^GACCAAATCCATGCAAA 387 

Qy 638 ACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTG 697 

I III Mill II III Mill Mill I Mill II I I I I I I 

Db 388 CAGTTCCAAATATTTTCATCACCAGCCTGGCTTTTGGAGATCTTTTACTTCTGCTAACTT 447 

Qy 698 ACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGT 757 

I II I III I II Mill I MM II III I 

Db 44 8 GTGTGCCAGTGGATGCAACTCACTACCTTGCAGAAGGATGGCTGTTCGGAAGAATTGGTT 507 

Qy 758 GTAAGCTGGTGCCTTTCATACAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTG 817 

Mill II I I II I II II II I I Ml I I II 

Db 508 GTAAGGTGCTCTCTTTCATCCGGCTCACTTCTGTTGGTGTGTCAGTGTTCACATTAACAA 567 

Qy 818 CTCTGAGTATTGACAGATATCGAGCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGG 877 

III II 1 1 1 1 1 1 1 1 1 II MM I I I 

Db 568 TTCTCAGCGCTGACAGATACAAGGCA.GTTGTGAAGCCACTTGAGCGACAGCCCTCCAATG 627 

Qy 878 TTCCAAAATGGACAGCAGTAGAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTG 937 

III Ml II II I I III I II III II I I III 

Db 628 CCATCCTGAAGACTTGTGTAAAAGCTGGCTGCGTCTGGATCGTGTCTATGATATTTGCTC 687 
Qy 938 TCCCTGAAGCCATAGGTTTTGATATA ATTACGATGGACTACAAAGGAAGTTATCT 992 

I Mill II III II II II II III I I II I I I I 

Db 688 TACCTGAGGCTATATTTTCAAATGTATACACTTTTCGAGATCCCAATAAAAATATGACAT 74 7 

Qy 993 GCGAATCTGCTTGCT-TC^TCCCGTTCAGAAGAC^GCTTTC^TGCAGTTTTAC^GACAG 1051 

Mill I II I MM II MM II III 

Db 748 TTGAATCATGTACCTCTTATCCTGTCTCTAAGAAGCTCTTGCAAGAAATACATTCTCTGC 807 

Qy 1052 CAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTGCCATTGGCCATCACTGCATTTT 1111 

I I I I II I I III I I I I I I I II 

Db 808 TGTGCTTCTTAGTGTTCTACATTATTCCACTCTCTATTATCTCTGTCTACTATTCCTTGA 867 

Qy 1112 TTTATACACTAATGACCTGTGAAATGTTGAGAAAGAAAAGTGG CATG CAGATTGCTTTAA 1171 

M M II I III I III II I I 

Db 868 TTGCTAGGACCCTTTAC^AAAGCACCCTGAACATACCTACTGAGGAACAAAGCCATGCCC 927 

Qy 1172 ATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCT 1231 

I Ml I I II I I I MM III II II MM I I I 

Db 928 GTAAGCAGATTGAATCCCGAAAGAGAATTGCCAGAACGGTATTGGTGTTGGTGGCTCTGT 987 

Qy 1232 TTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCTGAAGCTCACTCTTTATAATC 12 91 

I I II I I II M I II I I II MIIMI III I Ml I II 

Db 988 TTGCCCTCTGCTGGTTGCCAAATCACCTC CTGTACCTCTACCATTCATTCA 1038 



Qy 



1292 



AGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTGTTGGTATTGGACTATATTGGTA 1351 

I I II II I II I I I I I II I II III I 

CTTCTCAAACCTATGTA GACCCCTCTGCCATGCATTTCATTTTCACCATTTTCTCTC 1095 



Db 



1039 



Qy 



1352 



TCAACATGGCTTCACTGAATTCCTGCATTAACCCAATTGCTCTGTATTTGGTGAGCAAAA 1411 

llllll Mill III I Mill MIIMI II I I IIIIIMM 

GGGTTTTGGCTTTCAGCAATTCTTGCGTAAACCCCTTTGCTCTCTACTGGCTGAGCAAAA 1155 



Db 



1096 



Qy 



1412 



GATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTG 144 8 



Db 



1156 



GCTTCCAGAAGCATTTTAAAGCTCAGTTGTTCTGTTG 1192 




RESULT 15 

US-09-016-434-1215 

; Sequence 1215, Application US/09016434 

; Patent No. 6500938 

; GENERAL INFORMATION: 

APPLICANT: Janice Au- Young 

APPLICANT: Jeffrey J. Seilhamer 

TITLE OF INVENTION: COMPOSITION FOR THE DETECTION OF SIGNALING 
TITLE OF INVENTION: PATHWAY GENE EXPRESSION 
NUMBER OF SEQUENCES: 1490 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 

STREET: 3174 PORTER DRIVE 

CITY: PALO ALTO 

STATE : CALI FORNI A 

COUNTRY: USA 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Word Perfect 6.1 for Windows/MS-DOS 6.2 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/016 , 434 

FILING DATE: HEREWITH 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME: Zeller, Karen J. 

REGISTRATION NUMBER: 37,071 

REFERENCE/ DOCKET NUMBER: PA- 00 02 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (650) 855-0555 

TELEFAX: (650) 845-4166 
INFORMATION FOR SEQ ID NO: 1215: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1726 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : s ingl e 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 



LIBRARY: GENBANK 
CLONE: gl83649 
US-09-016-434-1215 



Query Match 2.5%; Score 106; DB 4; Length 1726; 

Best Local Similarity 52.0%; Pred. No. 2.1e-15; 

Matches 238; Conservative 0; Mismatches 220; Indels 0; Gaps 0; 
Qy 536 TCAAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACT 595 

II II III I I I III I II I I II I II 1 1 1 1 II Ml 

Db 514 TCCTCTATGTCATCCCTGCAGTTTATGGGGTTATCATTCTGATAGGCCTCATTGGCAACA 573 

Qy 596 C(^CACTTCTGAGAATTATCTA(^GAAC^GTGCATGCGAAACGGTCCCAATATCTTGA 655 

III I I I II III I Mill lllllllllll III II I II I 

Db 574 TCACTTTGATCAAGATCTTCTGTACAGTCAAGTCCATGCGAAACGTTCCAAACCTGTTCA 633 

Qy 656 TCGCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCT 715 

I 1 1 1 1 Mill MM I I I I I II I III I 

Db 634 TTTCCAGTCTGGCTTTGGGAGACCTGCTCCTCCTAATAACGTGTGCTCCAGTGGATGCCA 693 

Qy 716 ACAAGCTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCA 775 

II I Mill II MM MUM II II III I II II I 

Db 694 GCAGGTACCTGGCTGACAGATGGCTATTTGGCAGGATTGGCTGCAAACTGATCCCCTTTA 753 

Qy 776 TACAGAAAG CCTCCGTGGGAATCACTGTG CTGAGTCTATGTG CTCTGAGTATTGACAGAT 835 

Mill MM II II I MM II II III I III II I 

Db 754 TACAGCTTACCTCTGTTGGGGTGTCTGTCTTCACACTCACGGCGCTCTCGGCAGACAGAT 813 

Qy 836 AT CGAG CTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTT CCAAAATGGACAG CAG 895 

I III III I II I I III II 

Db 814 ACAAAGCCATTGTCCGGCC^ATGGATATCCAGGCCTCCCATGCCCTGATGAAGATCTGCC 873 

Qy 896 TAGAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTT 955 

III III II III I MM II I Mill I II II III I II 

Db 874 TCAAAGCCGCCTTTATCTGGATCATCTCCATGCTGCTGGCCATTCCAGAGGCCGTGTTTT 933 

Qy 956 TTGATATAATTACGATGGACTACAAAGGAAGTTATCTG 993 

III I I I I I I II I I III 

Db 934 CTGACCTCCATCCCTTCCATGAGGAAAGCACCAACCAG 971 



Search completed: December 13, 2003, 00:00:18 
Job time : 2 64.387 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 



December 12, 2003, 10:54:32 ; Search time 1259.7 Seconds 

(without alignments) 
11347.756 Million cell updates/s 

US-09-931-157-2 



Perfect score : 4301 



Sequence: 



1 gagacattccggtgggggac ctgggaaaaaaaaaaaaaaa 4301 



4403344 



Scoring table: I DENT I TY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 2201672 seqs, 1661799599 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Published_Applications_NA: * 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq: * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB . seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB.seq: * 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB . seq: * 

10: /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB. seq:* 

11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB . seq : * 

12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq: * 

13 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq2 : * 

14: / cgn2_6 /ptodata/2 /pubpna / US 1 0A_PUB COMB .seq:* 

15 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB. seq: * 

16 : / cgn2_6 /ptodata/2 /pubpna /US 10_NEW_PUB .seq: * 

17 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB . seq: * 

18 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB . seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-09-931-157-2 

; Sequence 2, Application US/09931157 

; Patent No. US20020082414A1 

; GENERAL INFORMATION: 

; APPLICANT: Imura, Hiroo 



APPLICANT: Nakao, Kazuwa 
APPLICANT: Nakanishi, Shigetada 
TITLE OF INVENTION: Human Endothelin Receptor 
FILE REFERENCE: 299002032411 
CURRENT APPLICATION NUMBER: US/09/931 , 157 
CURRENT FILING DATE: 2001-10-15 
PRIOR APPLICATION NUMBER: 08/121,446 
PRIOR FILING DATE: 1993-09-14 
PRIOR APPLICATION NUMBER: 07/911,684 
PRIOR FILING DATE: 1992-07-10 
PRIOR APPLICATION NUMBER: JP 3-172828 
PRIOR FILING DATE: 1991-07-12 
NUMBER OF SEQ ID NOS : 2 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 2 
LENGTH: 4301 
TYPE : DNA 

ORGANISM: Homo Sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (238) . . . (1566) 
US-09-931-157-2 

Query Match 100.0%; Score 43 01; DB 9; Length 43 01; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4301; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

IIIMIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMI 

Db 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

Qy 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 

IIIMIMMIIIMIIMIIIIIIIMIIIIIIIIMIIMIIIMIIIIMIMIIII 

Db 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 

Qy 121 AGGATC^ACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 18 0 

s : I M i ! Ml I ' !i 

Db 121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 18 0 

Qy 181 AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIII 

Db 181 AACTTGG CT CTGAAACTGCGGAG CGGCCACCGGACG CCTTCTGGAG CAGGTAGCAG CATG 24 0 

Qy 241 CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 

IIIIMIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIII 

Db 241 CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 

Qy 301 TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

I! II .1' II IIMMII II II 'I II II, I II lli'M 

Db 3 01 TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

Qy 361 ACCGC^GAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 

IIIIIIIIIIIIIIIIMIIMIIMIIIIMIMIIMIMIIIMIIIIIIIIIIIII 

Db 361 ACCGCAGAGATAATGACGCCA.CCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 

Qy 421 CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 480 

IIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIMIIIMIIIIIIIMIIIIIIII 



Db 



421 CTGG CGCGGTCGTTGGCACCTGCGGAGGTG C CTAAAGGAGACAGGACGG CAGGATCTCCG 48 0 



Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 



CCACGCACCATCTCCCCTCCCCCX3TGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 540 

I III I MM MM IMI MM IMMIM IMMI 

CCACGCACCATCTCCCCTCCCCCGTGCC^GGACCa\TCGAGATCAAGGAGACTTTCAAA 54 0 
TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIMIIIIIIIII 

TAC^TC^ACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 
CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

IIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIII 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 



IIIIIIIIIMIMIIIMIIIIIIIIIMIIIIIIMIMIMIIIIIIIIIMIII 



CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 78 0 

IMIIMIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIMIIIII 

CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 780 

AAAG CCTCCGTGGGAATCACTGTG CTGAGTCTATGTG CTCTGAGTATTGACAGATAT CGA 84 0 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMI 

AAAG CCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATAT CGA 84 0 



MIIIIIIMIIIIIMIIIIIIIIIIMIIIMIIIMIIIIIIIIMIIMI 



IIMIIMIIIIMIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMMII 



ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 1020 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 1020 



MMMMMMMMMMMMMMMMMMMMMMMMMMMMM 



TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 

AGAAAGAAAAGTGG CATG CAGATTG CTTTAAATGAT CACCTAAAGCAGAGACGGGAAGTG 12 00 

IMMMMM MM MMMMMMMMMMMMM MMMMMMMMMI 

AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 12 00 



M I M I M 1 1 1 1 1 II 1 1 1 1 1 M I II 1 1 II I II M 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 M 

[CCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 
.GCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 

Mi M, I Ml I 1 1 1 1 1 ' : I , 1 1 1 1 1 i mm M ; 



Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 



AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 

M II I Ml II III I IM MM II II II M I I I! I 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 
AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 

M II I MMM IM I II MM II M II IM I I III 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 
TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 

M II I Ml II II MMI MM II M II II I I I! I 

TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 



I II I I II II M MM I Ml I MIMM M MMI 



TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 

I Ml M I MMI II Mill Mill I MIIIMMMIMM 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 
AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTAT^ 1680 

MMM M II MMI I III I II MMM II II M Ml II 

AACAAAATGAAAGAlTTTGCCAAAACAA 168 0 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 

I II I M III MM III MMI II I MMM I II MM 

TAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG CTGT 174 0 
TTACGGCATGGAAAGAAAAT CAGTGGGAATTAAGAAAGC CT CGTCGTGAAAGCACTTAAT 18 00 

I Ml II I Ml MM III M II II I MMM Mill II M 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 180 0 
TTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATTCACACAACACT 1860 

IIIIMIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

TTTTTAC^GTTAGC^CTTC^CATAGCTCTTAAC^CTTCCAGGATATTCACACAACACT 1860 
TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 1920 

I M I II III II MMI Mill MMI MMM I M Mill 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 
AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 198 0 

IMIMM I II MM III I II II Mill II Mill II I 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 
AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 204 0 

I M M MIMMI li II I II IMM IIMIMI MMI 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 
TATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCAT 2100 

IIIIIIIIIIIIMIIIIIIIMIIIIIIIIIMIIMIIIIMMIIIIIMMIIIII 

TATCACACTATTAT CAGATTGTAATTAGATG CAAATGAGAGAG CAGTTTAGTTGTTGCAT 2100 
TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAG CAAGG CTGT 2160 

MIIIMIIIIIIMIMIIIIIIIIIIIMIIMIIIIIMIIIIIIIIIIIMIIIM 

TTTTCGGACACTGGAAACATTTAAATGATG^GGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 



Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 



TTTTGAAAATCATTACACTTTCACTAGAAGCCCA^CCTC^GCATTCTGCAATATGTAAC 2220 

IIIMIIIIIIIIIIIIIIMIMMIIIIMIIIIIIIIIIIMIIIIIIMIIIIIII 

TTTTGAAAATCATTACACTTTC^CTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 2220 
CAACATGTCACAAACAAGCAGCATC 2280 

Mill MMMMMMMMIMIIMM MMMIIMM MMIMIMM 

CAAC^TGTCACAAAC^GCAGCATGTAACAGACTGGCAC^TGTGCCAGCTGAATTTAA^ 2280 
TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 

! Ml J : 1 l ! M I I nil ■ 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTC^GTTAAGATC^AACCTCA 234 0 
CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

MIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMMIIIIIIIIIIII 

CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCZATTCA 24 00 
CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 24 60 

I! I III III I III! MM M I IIIIMII Ml MM MM MM Ml MM MM II I 

CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 24 60 
TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2520 

IIIIIIIIIIMIIIIIMIIIIIIIIMMMMIIIMIIIIIMIMIIIIIIMII 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2520 
CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTG CTCATAG CTTTACGATGGAGAGAT 258 0 

IIIIIMIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIMIIIIIIIIIIIIIIIII 

CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 2580 
GCC^GTGACCTC^TAATAAAGACTGTGAACTGCCTGGTGC^GTGTCCACATGACAAAGGG 264 0 

1 1 1 1 1 III I II II II 1 1 1 1 1 II II 1 1 II II M 1 1! 1 1 II 1 1 1 1 1 1 II II II 1 1 1 1 1 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 264 0 
GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

M : IM ! M I M M M I I , . MIM >■ 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 
GCTATAGTTAAAATACTATTTTTC^AAATC^TAC^GATTAGTAC^TTTAACAGCTACCTG 2760 

II I Ml II II I II II II II I M 1 1 II I II III II I II I II II IMIM II II I II I 1 1 1 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 
TAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTG CTTG 2820 

IMIIIIMIIMIIMIIMIMIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIII 

TAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 2820 
ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 

IIIIIIMMIMIiMIIMMIIMIIIIIMIIIIMIIIIIIIIMIIIIIMIII 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 
AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 294 0 

1 1 1 1 1 1 1 1 r 1 1 1 1 1 i i i 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f ) 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 294 0 
GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 

M M , M M ■ I ' , ! MM i ! ; 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3 000 
3001 GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3 060 



Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 



M II Ml Ml II Ml I II II II Ml M' III M IMMI I 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3 060 
CGTC^C^TC^TGC^^GGTCCTGATTTTGTTCCAGCAAAACACAGTGCAATGTTCT^ 312 0 

IIIIIIIIIIIIIIMIIMIIIIIIMIIMIIIIIMIIIIIIIIIIIIMIIIMII 

CGTC^C^TCAATGC^W^GGTCCTGATTTTGTTCCAGCAAAACACAGTGCAATGTTCTCA 312 0 
GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 318 0 

IIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIMIIMIIIIII 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 318 0 
ATTTTTACTTTGTTTTTCTTTTAATAGG CTGGG CCACATGTTGGAAATAAG CTAGTAATG 324 0 

MM II II II II M MM II II II MMM M M II I 

ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 
TTGTTTTCTGTCAATATTGAATGTGAT^ 3 3 0 0 

I MMM II M 'Ml M II :MMI IM I Ml M Ml IMM 

TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCA 3300 
GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 

IMMI M MIMMMI I II II I MM M III M IMMI . 

GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 
AC^W^CTTGTTCTTTAATTTCATCCC^TCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 

IMIIMIIIIIIIIIIIIIIIIMMIIMIIIIIIIIIMIIIIMIIIIIIMIIII 

AC^U^CTTGTTCTTTAATTTCATCCC^lATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 
CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 3480 

I MMM II MMM I II II M MM I Ml MMM IM'i 

CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 3480 
TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

IMMI I MM M II M MM II M MMM IM MMMMI 

TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 



I i II ! : 1 1 1 I . II I il 1 1 1 ■ 1 1 flii I 1 1 i i .' 1 1 1 : 1 1 I 



TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 

I M M IM I M II II II I II I IM MM, II 

TGAAACTAC^CAC^Wy^GCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 
TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 

I II Mi II I Ml II II II. I Ml II II II M ; II I 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 



'IM I M M IM'M'-M M I Ml MMM IM I MM 

TCTTTAC^TACTC^W^CCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAAC^TG r 
TCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTi 

M M ! IM I MM M M' II II M MM! MM I MM 

TCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTi 
!AGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAA r 

MMM I MM M II II I IMMI IM IM MMM I MM 



Db 



3841 CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3 900 



Qy 3901 GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 

ll' II 'I II II : 

Db 3901 GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 

Qy 3 961 GTTTATAG CAAAACATGGGTATG CTGTAG CTAACTTTATAAAAGTGTAATATAACAATGT 4 020 

MIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

Db 3 961 GTTTATAG CAAAACATGGGTATG CTGTAG CTAACTTTATAAAAGTGTAATATAACAATGT 4 020 

Qy 4021 AAAAAATTATATATC^^ 4080 

Db 4 021 AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 080 

Qy 4 081 TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 

I II I Ml Ml i IMMI M II Mi II M II MM i 

Db 4 081 TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 

Qy 4141 ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 

I II II III II I Ml II MMI IMMI II IM M M 

Db 4141 ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 

Qy . 42 01 CAAGTGGACATTATTTATGTTAAATATACAATTAT 4260 

IIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMMIIIII 

Db 42 01 CAAGTGGACATTATTTATGTTAAATATAC^TTATCAAGCAAGTATGAAGTTATTCAATT 4260 

Qy 42 61 AAAATGCCACATTTCTGGTCTCTGGGAAAAAAAAAAAAAAA 43 01 

IIIIIIIIMIIIIIIIIIIIMIIIIIMIIIIIIIIIII 

Db 4261 AAAATGCCACATTTCTGGTCTCTGGGAAAAAAAAAAAAAAA 43 01 



RESULT 2 

US-10-007-926A-177 

Sequence 177, Application US/10007926A 
Publication No. US20030143539A1 
GENERAL INFORMATION: 
APPLICANT: BERTUCCI , FRANCOIS 
APPLICANT: HOULGATTE, REMI 
APPLICANT: BIRNBAUM, DANIEL 
APPLICANT: NGUYEN, CATHERINE 
APPLICANT: VI ENS, PATRICE 
APPLICANT: FERT, VINCENT 

TITLE OF INVENTION: GENE EXPRESSION PROFILING OF PRIMARY BREAST CARCINOMAS 
TITLE OF INVENTION: USING ARRAYS OF CANDIDATE GENES 
FILE REFERENCE: 1546-R-00 

CURRENT APPLICATION NUMBER: US/10/007 , 926A 
CURRENT FILING DATE: 2001-12-07 
PRIOR APPLICATION NUMBER: 60/254,090 
PRIOR FILING DATE: 2000-12-08 
NUMBER OF SEQ ID NOS : 468 
SOFTWARE: PatentlnVer. 2.1 
SEQ ID NO 177 
LENGTH: 4286 
TYPE : DNA 

ORGANISM: Homo sapiens 
FEATURE : 

OTHER INFORMATION: endothelin receptor type b (EDNRB) gene. 



US-10-007-926A-177 



Query Match 99.6%; Score 4284.4; DB 13; Length 4286; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4285; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 6 0 

IMIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIMI 

GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 6 0 
AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 

IIIMIIIIIIIIIIIIIMIIMIMIIIIIIIIIMIIIIIIIIIMIIIIIIIIIII 

AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 120 
AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 180 

I : ! II II ■ i I 'III II 

AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 180 
AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

lllllllllllllllllll! IIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIMI 

AACTTGGCTCTGAAACTGCGCAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 
CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 

! 1 1 1 1 II I ! 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1! 

CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 3 00 
TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

MIMIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIMIIIIIMIIIIIMI 

TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 
ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 420 

IMMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIMIIIIIMIIIIIMIIIIII 

ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 
CTGGCG CGGTCGTTGG CAC CTGCGGAGGTG CCTAAAGGAGACAGGACGGCAGGATCTC CG 480 

Mh II I Mill I II II Ml Mill I III IMIMII Ihll 

CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 480 
CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 

IIIIIMIIMIIIMMIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 

CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 
TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

I II I II IM I II M I Ml IMM. II II I II M 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 
CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

Ml II I IMM M II II Mi IMM II II II M II MM 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 
AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 72 0 

MUM I Ml IMM II II I Ml III I II Mill MUM 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 720 
CTGCTGG CAGAGGACTGG CCATTTGGAG CTGAGATGTGTAAGCTGGTG CCTTTCATACAG 780 

M MIMI II II MUM II III IM I IMIIMI I II MlM 

CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 780 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


uu 


O^t 1 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 



Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 



AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 

IIIIMIIMIIIIIMIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

AAAGCCTCCGTGGGAATCACTGTG CTGAGTCTATGTG CTCTGAGTATTGACAGATATCGA 84 0 
GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 

II 1 1 II 1 1 1 1 1 1 1 1 1 ! 1 1 ill I II III III II I II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 ! I II ! 

GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 
ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

II III I III II II MMM II II IMMI I Ml MM I 

ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 
ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 1020 

MM MM MM M IMM MM M MIMIMI II I 

ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 1020 
AAGACAGCTTTCA.TGCAGTTTTAC^GAC^GCAAAAGATTGGTGGCTGTTCAGTTTCTAT 1080 

M IM M II MM I II II II M II IMIMMMMI M 

AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 1080 
TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 

IMMI M MM II M Ml Ml MM MUM II I II MMM! 

TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 
AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 12 00 

MIIMIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMI 

AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 12 00 
GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 

II II II II MM M II Mill M M II M II II II 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 
AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 1320 

M Ml I IMMM Mill Ml III MMM II! I Ml Ml I 

AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 1320 
AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 

II Ml I Ml II II MMMI IIM.I IMMI M IMMI M 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 
AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 1440 

M M IMIMIMI I IM III II I II II M Ml III II 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTOW^AACTGCTTTAAGTCATGCTTA 144 0 
TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 15 00 

IIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIMI 

TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 
AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 

II MMM MM IMMIMI II II Mill IIMIIMI IM II 

AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 
TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 

IMMIMI MM IMIIIMM II IMMIMI II MIMMIII Ml 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 



Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Ov 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 



AACAAAATGAAACATTTGCO^AAACAAAACAAAAAACTATGTATTTGCACAGaiCACTAT 1680 

I II MMM II IMM II M MM II II II I III Mill 

AACAAAATGAAACATTTGCC^ 1680 
TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 

M II II Ml II II M II Ml II II II M I IM MMM 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 
TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAG C CTCGTCGTGAAAGCACTTAAT 1800 

1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 MM 1 1 1 1 1 1 1 II 1 1 1 1 1 1! I! 1 1 1 1 1 1 1 1 1| 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAG C CT CGTCGTGAAAGCACTTAAT 18 00 
TTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATTCACACAACACT 1860 

I I II Ml II II I II Ml II lllilll llllll IIMM II 

TTTTTACAGTTAGCACTTC^CATAGCTCTTAACAACTTCC^GGATATTCACAC^ACACT 1860 
TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 

II I III INI lllilll MM 1 1.1 MM III INI Mill llllll III INI Mill 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 
AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 

IIIIIIIIIIMIIMIIIIIIIIIIIIIMIIIIIIIMIMIIMMIIIIIIIIIII 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 
AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 

IMIMIM IMIIMM MMM I MM IM I III I MM Mill 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 
TAT CACACTATTATCAGATTGTAATTAGATG CAAATGAGAGAGCAGTTTAGTTGTTG CAT 2100 

IIIIIIIIIIIIIMIIIIIIIIMIIIIMIIIIIIIMIIIIIIIIMIIIIIIIIII 

TATCACACTATTATCAGATTGTAATTAGATG CAAATGAGAGAGCAGTTTAGTTGTTG CAT 2100 
TTTTCGGAC^CTGGAAAC^TTTAAATGATC^GGAGGGAGTAAC^GAAAGAGCAAGGCTGT 2160 

IMIMIMIMIIIIIIIIIIIIIillllllMIIIMIIIIMIIMIIIIIIIIIII 

TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 
TTTTGAAAATCATTACACTTTC^CTAGAAGCCCAAACCTCAGC^TTCTGCAATATGTAAC 222 0 

1 1 1 1 M II 1 1 1 1 1 1 II 1 1 II 1 1 1 II II II II II 1 1 II II II I II 1 1 II I II 1 1 1 1 1 1 1 1 1 

TTTTGAAAAT(^TTACACTTTC^CTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 2220 
CAACATGTCACAAACAAGCAGCATGTAAC^ 228 0 

1 1 1 M I II II II II II I II I II II II 1 1 II I II 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II I II 1 1 

CAACATGTCACAAACAAGCAGCATGTAACAGAC 2280 
TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 

IIIIIIIMIIIIIIIIIMIIIIIIMMIIIIIIIMIMIIIIIMIIIMIIMM 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 III lllilll MM MMMIMIMM MM III M II II 



CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 24 60 

I 'IMIIMM I Mill MM MMM IM I Ml MMM IM 

CA.TACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 24 60 



Qy 



24 61 TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2520 



Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 



II I II Ml II II MMI I II Ml I IM I II MMI 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2520 
CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 2580 

M I II M II M I II I II Ml I II I II I Ml 

CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 258 0 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 2 64 0 

IIIIIIIMIMIIIIIMIMIMIIMIIIIIIIIMIIIIIIIIMMIIIMIIII 
GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 264 0 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

I I III II II M I II I II MM I II II II; I Ml 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 
GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 

IIIIIIMIIIIIIIIIMIIIIIIIIIIIMIIMIMIIMIIIIIMIIIIIIMII 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAG CTAC CTG 276 0 
TAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTG CTTG 282 0 

I M IM II II II I II I M II M MI M II M II 

TAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAG CCAATAGAAAAGTTTG CTTG 282 0 
ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 288 0 

I M IM II IM II M II I M II II Ml M M M II 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 28 8 0 
AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 

I I IM II II II I II I M' II M II I! IM I M 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 
GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 

1 1 1 e 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 i 1 1 1 1 1 1 1 1 1 1 r i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r e 1 1 1 1 1 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 
GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3 060 

I I M M II MM MMM M II Ml I IM I II MMI 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3 060 
CGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAAC^ 312 0 

IIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 

CGTCACATC^TGCAAAAGGTCCTGATTTTGTTCCAGCAAAACA(^GTGCAATGTTCTCA 312 0 
GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 318 0 

M I II MMM II I II II III M I ll I Ml i I 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 3180 
ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

IIIIIIIMIIMIIIIIIIIIIMIIIIIIIIIIMIIIIIIMMIIIIIIIIIIIII 

ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 
TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCA 3300 

I M M, M II MM M:M II II Ml I IM I Ml M II 

TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCIAAAACCCAACAATGTGGCCA 3300 



I II M, II I MM I MMMI iMil I II M Ml Ml I 



Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 


3961 


Qy 


4021 


Db 


4021 


Ov 

wy 


A HQ 1 


Db 


4081 


Qy 


4141 


Db 


4141 



3301 GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 

AC^AACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 

I I' II I II ! Ml II lllill :l II I' I! !l II I I 

ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 

CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 34 80 

I 1 1 ' 1 1 1 . 1 II .1 II , 1 1 1 1 1 I 1 1 l , 1 1 

CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 34 80 
TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

II Ml M Ml M Mill II IMMIMIM ill MMM I I 

TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 
TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 
TGAAACTACACACAAAAAG CATACTTGCATTATTTATAATAAAATTG CATTCAGTGG CTT 3660 

M MMI MMM! II II II II I MM MM MIMIIMI 

TGAAACTACACACAAJ^GCATACTTG 3660 
TTTAAAAAAAATGTTTGATTCAAAACTTTAACATA 3 72 0 

I I MMI III MMI II Mill! I II IMIMI II MMI 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 3720 
TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 378 0 

MIIIIIIIIIIIMIIII MMIIMIMIIMIMIIIIMMMIIIIMMII III 

TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTT^ 378 0 

TT CCTAGTATTAAGGACTTTAATATAG CAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 

1 1 1 1 1 1 M 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 i 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 i 1 1 1 II I 

TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 
CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3 900 

M I Mill Ml II II II M MMM II I III I IMIIIMI 

CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3 900 
GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 

M II II Ml M II II I III M II I III I IIMMMI 

GTGGATGTATGTTCAAACACCTTTTAGTATTGATAG CTTACATATGG CCAAAGGAATACA 3 960 
GTTTATAG CAAAACATGGGTATG CTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4 02 0 

M MMI III MMI II I IIMII II I III I IMMIIII 

GTTTATAG CAAAACATGGGTATG CTGTAG CTAACTTTATAAAAGTGTAATATAACAATGT 4 02 0 
AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 080 

M Ml M Ml M II II I MMM II IMII I MIMM 

AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 08 0 
TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 

I M Ml I III I II II MM M I! I I! IMIMM I 

TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 
ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 4200 

M MMI Ml IMIMIMIMI IMMMMI M II I Ml 

ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 4200 



Qy 4201 CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTCAATT 4260 

II II IL II !l IMihJI I! I IL I Ihil II I! I 

Db 42 01 CAAGTGGACATTATTTATGTTAAATATAC^TTATCAAGC^GTATGAAGTTATTCAATT 4260 

Qy 4261 AAAATGCCACATTTCTGGTCTCTGGG 4286 

MIIIIIIMIIIIIIIIIIIIIMI 

Db 42 61 AAAATGCCACATTTCTGGTCTCTGGG 4286 



RESULT 3 

US-09-921-406C-13 

Sequence 13, Application US/09921406C 
Publication No. US20030152923A1 
GENERAL INFORMATION: 
APPLICANT: Yakhini, Zohar 
APPLICANT: Ben-Dor, Amir 
APPLICANT: Sampas, Nick 
APPLICANT: Dougherty, Edward 
APPLICANT: Trent, Jeff 
APPLICANT: Meltzer, Paul 
APPLICANT: Chen, Yidong 
APPLICANT: Weeraratna, Ashani 
APPLICANT: Jiang, Yuan 
APPLICANT: Bittner, Michael 
TITLE OF INVENTION: Classifying Cancers 
FILE REFERENCE: 10010313-1 

CURRENT APPLICATION NUMBER: US/09/ 92 1 , 4 06C 
CURRENT FILING DATE: 2001-08-02 
NUMBER OF SEQ ID NOS : 41 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 13 
LENGTH: 428 6 
TYPE : DNA 

ORGANISM: Homo sapiens 
US-09-921-406C-13 

Query Match 99.6%; Score 4284.4; DB 13; Length 4286; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4285; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

MIMIIIIIIIIIIIMIIMIIMIIMMIIMIIIIIIIIIIIIIIIIIIIIIIM 

Db 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

Qy 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 120 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIII 

Db 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 120 

Qy 121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 180 

MIIIIIIIIMIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIM 

Db 121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 180 

Qy 181 AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

iiiiiiiMiiiiiiiMii i r 1 1 1 r 1 1 1 r 1 1 f 1 1 1 1 1 1 1 1 1 f j 1 1 1 1 1 1 i i r r 1 1 1 r i 

Db 181 AACTTGGCTCTGAAACTGCGCAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 



Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


wy 




Db 


961 


Qy 


1021 


Db 


1021 



CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 3 00 

I I II I I I I I I I I I I I I I I I II I I II M I I I I I I I I I I I I I I I I I I II I I I I I I I M I I 
CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 

TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

IMIIIIIIMMIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIMIIII 

TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 
ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 420 

MMIMIMIIIIIIMIMIIIIIIMIIIIIIIIIIMIMIMIIIIIIIIIIIM 

ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 420 
CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 

II 1 1 M I M 1 1 1 1 1 1 1 1 IM 1 1 1 1 1 1 II 1 1 1 1 MM I: i 1 1 II 1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 

CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 
CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 540 

IIIIIMIIIIIIMIIIMIIIIIIIMIIIIIMIIMIIMIMIIIIIIIIIIIII 

CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 
TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

MIIMIMIIIIIIIMMIIIIIIIIIMIIIMMIIIIIMIIIIIIIIIIMIM 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 
CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

IMIIIIIMIIIIIIIIIIIIIIIIMIIIIIMIIIMIIMIIMIIIIIIMIIII 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 
AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 720 

Mill II M II II IM II II IM II II MM IM II I 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 720 
CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 78 0 

1 II IM M II II III II M IM II II IM III II M 

CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 78 0 
AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 

II II II Ml IM II M M II M M II Ml II 

AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 
GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 

M Ml MM II Ml Ml II III II II MM II II II I 

GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 
ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

MM M M II M II IM II :M Ml MIMM II Ml I 

ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 
ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 102 0 

II II M II II il M IM II II IM II Ml II 

ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 102 0 
AAGACAGCTTTCATGCA.GTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 1080 

M II MIMM II IM M M II M, M || IT |l I 

AAGACAGCTTTCATGC^GTTTTACAAGAC^GC^AAGATTGGTGGCTGTTCAGTTTCTAT 108 0 



Qy 1081 TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 



Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 



IIMI II II II M : I IIMI M Mill II I i II !M 

TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 1140 



IMMIIMIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 



GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 

MMM , I II II I I IIMI M I Ml II M l II M 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 
AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 132 0 

IIIIIMIIIIIIIMIIIMIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIMI 

AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 132 0 
AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 13 80 

IIMI ; I II II I I Mill Ml II II I I MM' 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 13 80 
AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 

MMM ; I II IM I I MMM II I M II MM IIMM 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 



I II M M II II I I IIMI M I M II I M MM! 

'GCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTj 
AGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTG 

I II MMI II M! I I II II M IMM II I I II ! 

AGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCj 
'CTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAj 

I II MM II II ! I M II M MMI M I L II I 



AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACT^ 1680 

I II I I IM II MM II II M MMM II M IIMI II 

AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACT^ 168 0 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 

I M I I Ml II I M II II M M Ml M I MMI I 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 

TTACGG CATGGAAAGAAAAT CAGTGGGAATTAAGAAAGCCTCGTCGTGAAAG CACTTAAT 18 00 

MMI M I IIMI I ; IIMM I M II II! I I II II 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 

TTTTTACAGTTAGCACTTCAACATAGCTC 1860 

I II i; i II M I I ^IMM I: ! MMM i I M II 

TTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATTCACACAACACT 1860 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 

I! MMM I II II I I II II M MMM I! I MMI II 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 198 0 

MMMM I Ml II MM II II Ml I MMI I M Ml II 



Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Ov 


9701 

£t 1 \J ± 


Db 


2701 


Qy 


2761 


Db 


2761 



1921 AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 198 0 
AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 204 0 

III INI III IMIIII llllll MM lllllllllll II INI III III Mlllllll 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 
TATCACACTATTAT CAGATTGTAATTAGATGCAAATGAGAGAG CAGTTTAGTTGTTGCAT 2100 

IIIIIIMIIIIIMMIIIMIIIIIIIIMIIIIIIIIIIIMIIIIIIIIMIMII 

TATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCAT 2100 
TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 

Ml II! I M Ml I I II I II II I Mill I MMI I 

TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAG CAAGG CTGT 2160 
TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 222 0 

II IM I M Ml I M II I II II I Mill I Mill 

TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 222 0 
(^CATGTCACAAACAAGCAGCATGTAACAGACTGGCACATGTGCCAGCTGAATTTAAAA 228 0 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIII 

CAACATGTCACAAACAAGCAG CATGTAACAGACTGGCACATGTG C CAGCTGAATTTAAAA 2280 
TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 

I II II I II M I III llllll Mill Ml II M II IM I 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 
CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

MIIIIMM lllllllllilllllllllllllllllllllllllllllllllllll II 

CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 
CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 24 60 

M MMI M Ml! I I II I MMI IMIIMM I MMI M 

CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 2460 
TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 252 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIMIMI 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 252 0 
CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 2580 

IIIIIIIIIIIIMIIIIIIIIIIMIIMMIIIIIIIIIIIIIIIIIIIIIIMIIII 

CTG CATGTAGATGATTAAATGAGGGCAGGCCCTGTG CTCATAG CTTTACGATGGAGAGAT 258 0 
GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 264 0 

MIIIMMIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIMIIIIIIIIIM 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 264 0 
GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

MM I M II I IMIMM! II IM I Ml II M II II M 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 
GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 

IIMIIIMIMIIIIIMIIMIIIIIIIIIMIIIIIIIIIIIIIMIIIMIIIIM 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 
TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 282 0 

M M I II IMM M II I IM I M IM M I MMM 



Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 



ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 

I : : 1 1 II I III II 1 1 1 : 1 ! II i 1 1 I I ; 1 1 : 1 1 I I III 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 
AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 

I IMIMM M Ml II M M II M MjI II M MM IM I 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2940 
GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3 000 

MMI M I IMIMM I Mill I I II II M I II 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 



Ml I I M II II I M M IM I II II II M I II 



CX3TCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTGCAATGTTCr(^ 3120 

I M M M II IIIIIMIIM MMM I I II II MMI Ml 

CGTC^CATCAATGCAAAAGGTCCTGATTTTC 312 0 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 318 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIMI 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 318 0 
ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

I II I M II IM ll MMI II I I I II M MiMI I 

ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 
TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCA 3300 

IMMIIIIIIMIIIIIIIMIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIII 

TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCA 33 00 
GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 

IIIIMIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

GAAAGAAAGAGCAATAATAATTAATT(^C^CACCATATGGATTCTATTTATAAATCACCC 3360 
ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 

I MMI Ml MMI MM, MMM I MIMI III I I MMM 

ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 



IIIIMIIIIIIIIIMIIIIIIIIIIIIIMMIIIIIIMIIIIIIIMIIIIM 



TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

Mill I I MIMMM I Ml II I M Mill MMIIIIIMI 

TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 



IMI I I MIMI II I Ml IM I M MIMIMIMI MMM 

'CCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATT' 
'GAAACTACAC^CAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCT' 

M' M MMI Mill I I Ml II I M MMM IIMMMI II 



Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 


3961 


Qy 


4021 


Db 


4021 


Qy 


4081 


Db 


4081 


Qy 


4141 


Db 


4141 


Ov 


4201 


Db 


4201 


Qy 


4261 


Db 


4261 



TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 

! : ■ ! I ■ : ! : : 1 1 ■ ! ; : 1 1 . ; 1 1 : 1 : ! I : . 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 
TTCTTTACATACTC^AAACCAAGATAGAAAAAGGTGCTATCGTTC^CTTCAAAACATGT 3780 

i : I ! . . i ! : , ' ! I ■ 1 1 ! 1 I i I I 

TTCTTTACA.TACTC^y^CC^GATAGAAAAAGGTGCTATCGTTCAACTTCAAAAC^VTGT 3780 
TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 

M Ml I ll M M M M n immimm IMM Ml M M 

TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 



M I IM I IMM II II II II II II I II I II I M 



GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3960 

M I II MIMM MUM II M II II I Ml I II MM 

GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 
GTTTATAG CAAAACATGGGTATG CTGTAG CTAACTTTATAAAAGTGTAATATAACAATGT 4 02 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIMIIIIII 

GTTTATAGCAAAACZATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4 02 0 
AAAAAATTATATATCTGGGAGGATTTTTTGGTTG CCTAAAGTGG CTATAGTTACTGATTT 4 080 

Ml III M II M II I II II MIMI Ml IMM III M Ml 

AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 08 0 



MM IM M II I M I II I MIMI Ml II I III I I 



ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 4200 

M I IM I II I IM I: II M IMM M MM M II MM 

ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 
CAAGTGGACATTATTTATGTTAAATATACA^ 42 60 

I Ml II II MIMM II I MM II II II; I II M: III I I 

CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTCAATT 4260 



IIMIMIIIIIMIIMMIIIII 



RESULT 4 

US-10-210-120-15 

; Sequence 15, Application US/10210120 

; Publication No. US20030175736A1 

; GENERAL INFORMATION: 

; APPLICANT: Chinnaiyan, Arul M. 

; APPLICANT : Rubin, Mark A. 

; APPLICANT: Sreekumar, Arun 

; TITLE OF INVENTION: Expression Profile of Prostate Cancer 

FILE REFERENCE: UM-07221 
; CURRENT APPLICATION NUMBER: US/10/2 10 , 120 
; CURRENT FILING DATE: 2002-08-01 



; PRIOR APPLICATION NUMBER : US 60/309,581 
; PRIOR FILING DATE: 2001-08-02 

PRIOR APPLICATION NUMBER: US 60/334,468 
; PRIOR FILING DATE: 2001-11-15 
; NUMBER OF SEQ ID NOS : 123 

SOFTWARE: Patent In version 3.2 
; SEQ ID NO 15 

LENGTH: 4286 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-210-120-15 



Query Match 99.6%; Score 4284.4; DB 13; Length 4286; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4285; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 


i 


GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 


60 






1 M II MM M I! 1 II MMI 1! M II IMM II 1 




UJJ 


± 


o>ioAUA 1 1 LLuo lubbbuHL itl ooL.v_AoUUUoAov_^\AL.o 1 ooA 1 LL1 0A0A0LAL 1 Lv_L 


D 0 


Qy 


61 


AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 


120 






11 IMIM II I MM M II III III Mill III II IMMM 




Db 


O J- 




ion 
IzU 


Qy 


121 


AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 


180 






1 II II II 1 MM 1 II Ml IM IMM IM II IMMM 




Db 


121 


AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 


180 


Qy 


181 


AACTTGG CTCTGAAACTGCGGAG CGG CCACCGGACGCCTTCTGGAG CAGGTAG CAG CATG 


240 






IMIMIIIIIIIIIIIIII MMMMMIMMMMMMMMMIMMIMM 




Db 


1 ft 1 

X O X 


i-u-i^ i 1 bbL 1L1 oi-u-LH.^ 1 bLoLHoLbbLLHLLbbHLbLL 1 1L1 ooA.vjUA.vjo 1 AoLAoUA 1 o 


z4 I) 


Qy 


241 


CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 


300 






1 Ml M III II M IMMM IMM II 1! II Mill IM 1 




Db 


241 


CAGCCGrrTrpAAnTrTnTnpnnAPnrnrrrTfjriTTr:rp,r i Tnp,TTPTTrippTnpnr , pnTP 

\ — nuv-VwUv^v- ± \w.v_~ririo O J- vjL.vJUrlv^uLvjLLV, 1 OO 1 IbLbL 1 Oo 1 1 uLL X uLuuLL 1 O 


inn 
j UU 


Qy 


301 


TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 


360 






1 1 1 M 1 1 1 1 1 1 M 1 1 II II 1 1 1 1 1 1 1 II 1 1 1 II 1 II 1 1 II 1 1 1 1 1 1 1 1 II M 1 1 M 1 1 1 1 




Db 


301 


TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 


360 


Qy 


361 


ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 


420 






II III MMI 1 MM II II II MMMM MM II II MMM 




Db 


361 


ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 


420 


Qy 


421 


CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 


480 






M 1 II 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 II 1 II II 1 1 M 1 1 1 II 




Db 


421 


CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 


480 


Qy 


481 


CCACGC^CCATCTCCCCTCCCCCGTGCCAAGGACCCA^ 


540 


Db 


481 


MIIMIIIMIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIMIIIMIIMMIII 

CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATC^ 


540 


Qy 


541 


TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGG 


600 






MMMMMMMMIMMMMMMMIMMMMMMIMIMIMMMIM 




Db 


541 


TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCT^ 


600 


Qy 


601 


CTTCTGAGAATTATCTACAAGAAa^GTGCATGCGAAACGGTCCCAATATCTTGATCGCC 


660 



Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 



IIIIIIIIIIIIIIIIIIMIIMIIIMIIIIIMIIIIIIIIIIIIIIMIIIIIIII 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 720 

IMIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIMMIIIIIIIIIIIIIIMIIIII 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 720 

CTG CTGG CAGAGGACTGG CCATTTGGAG CTGAGATGTGTAAG CTGGTG CCTTTCATACAG 78 0 

IMIIIIIIIIMIIIIIIIIIMIIIMIIIMIIIIIIIIIMIMIIIIIIIIIIII 

CTGCTGG CAGAGGACTGG CCATTTGGAGCTGAGATGTGTAAG CTGGTG CCTTTCATACAG 780 

AAAG CCT CCGTGGGAATCACTGTG CTGAGTCTATGTG CTCTGAGTATTGACAGATATCGA 84 0 

MIIIIMIIIIIIIMIIIIIIIIMIIIIIIMIIMMIMMIIIMIIIIIIIII 

AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTG CTCTGAGTATTGACAGATATCGA 84 0 

G CTGTTG CTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 

IIIIIIIIMIIMIIIIMIIIMIIMIMIIIIIIIIIIIIIIMIIIIIIMIIII 

G CTGTTG CTT CTTGGAGTAGAATTAAAGGAATTGGGGTTC CAAAATGGACAG CAGTAGAA 900 



IIIIIIIMIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIMIIIIIMIIIIIIIII 



ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 1020 

IIMIIMIIIIMIIIIMIIIIIIIMIIIIIIIIMIIIIIIIMIIIIIIIIIIII 

ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 102 0 
AAGACAGCTTTCATGCAGTTTTACAAGAC^GCAAAAGATTGGTGGCTGTTCAGTTTCTAT 108 0 

IIMMIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 108 0 
TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 

1 1 : i i i M i 1 1 1 i 1 1 1 1 i i i : ! i 

TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 
AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 1200 

IIIIIIMIMIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIMIIIIIMIIII 

AGAAAGAAAAGTGG CATG CAGATTG CTTTAAATGATCACCTAAAG CAGAGACGGGAAGTG 1200 
GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 12 60 

IMIIIIMIMIIIMIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIMMIMIII 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 

AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 132 0 
I M M II .1 'II II :| || Ml II Ihll II' I 'II I 

AG CAGGATTCTGAAGCTCACT CTTTATAATCAGAATGATCC CAATAGATGTGAACTTTTG 1320 



MIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIMIIIIII 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 13 80 

AACCQUVTTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 

IIMIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 

TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 

IMIIIMIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 



Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


wy 




Db 


2221 


Qy 


2281 


Db 


2281 



1441 TG CTG CTGGTG CCAGTCATTTGAAGAAAAAG^GTCCTTGGAGGAAAAG CAGTCGTG CTTA 1500 

AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 

M 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 Ml li; II 1 1 1 1 IN I 1 1 1 1 1 1 ! 1 1 1 1 I II 1 1 1 1 1 1 1 

AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 



MMIIIIIIMMMIMIIIIIIIIIIMMIIIIIIMIIIIIIIIIIIMIII 



AACAAAATGAAACATTTGCCAAAACAAAA 1680 

MMI M MMMI I II II Ml II II EM M IM II Ml 

AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTATGTATTTGCACAGCACACTAT 1680 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 

II IIMI II lll'lli IIIMI I II I! II I III II: 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAG C CTCGTCGTGAAAG CACTTAAT 1800 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIII 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAG C CTCGTCGTGAAAG CACTTAAT 1800 

TTTTTACAGTTAGGACTTCAACATAGCTCTTAACAACTTCCAGGATATTC^ 1860 

IIIIMIIMIIIMIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIII 

TTTTTACAGTTAGCACTTCAACATAGCTCTTAACA^ 1860 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 1920 

IIIIIIIMIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIMIIIIIIIIII 

TAGGCTTAAAAATGAGCTCACTC^GAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 1920 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 

MINI IIIIIIIIIIMIMIIIIIIIIII IIIIIIIIIMI MIIIIIIIIMI 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 204 0 

M I Ml MM MMI MMM I II II II MMI II Ml 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 204 0 

TATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAG CAGTTTAGTTGTTG CAT 2100 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TATCACACTATTATCAGATTGTAATTAGATG CAAATGAGAGAG CAGTTTAGTTGTTG CAT 2100 

TTTTCGGACACTGGAAACATTTAAATGATC^GGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 

IIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIMIIIIIIIIIIIMIII 

TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 

TTTTGAAAATCATTAC^CTTTCACTAGAAGCCC^ 2220 

lIlllllMIMllMlllllllllllMlllllllllllllllllllllllllllllll 

TTTTGAAAATCATTAC^CTTTCACTAGAAGCCCAAACCTCAGC^TTCTGCAATATGTAAC 2220 

CAACATGTCACAAACAAGCAGCATGT 22 80 

IIIMIIIIIMMIMIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIMII 

CAACATGTCACAAACAAGCAGCATGTAACAGACT^ 22 8 0 

TATAATACTTTTAAAAAGAAAATTATTAC^TCCTTTACATTCAGTTAAGATC^^CCTCA 234 0 

!' MM| Ml II II IMMI I II Ml M MM MM I 

TATAATACTTTTAAAAAGAAAATTATTAC^TCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 



Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 



CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

II Ml M III II M II MM III I II Ml III IMMM I! 

CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 
CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 2460 

IM II Mill III I MM II Ml MMM! IM I MM II I 

CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 24 60 
TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2520 

MIMIIIIIIIIMMIIIIIIIIMMIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2520 
CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 258 0 

IIIIIIIIIIIIIMIIMIIIIMIIIMIIIIIIIMIIIIIIMIIIIIIMIIIM 

CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 2580 
GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 2 64 0 

II II M III II Ml II IM IM I II II M II M II 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 264 0 
GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

M II Mill IM IM M M II MIMM MIMMIM IMM 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 
G CTATAGTTAAAATACTATTTTT CAAAATCATACAGATTAGTACATTTAACAG CTACCTG 2 760 

IIIIIMIIIIIIIIIIIIIIIMIMIIIIIIIIIIMIMIIIIMIIMIIIIIMI 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 
TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 282 0 

M M Mill IM II I II II III II IM IMM, III II 

TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 282 0 
ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 

M MIMI MMM MMM II IM I MMMI II I Ml II 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 288 0 
AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 

M M MMi II M I II II M II II MMM. II; I 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 
GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 

IMIIIIIIIIIIIIIIMIIIIIIIMIMIIIIIIIIIIIIIIIIIIIMMIIIIM 



GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3 060 

I M Ml I II II M:MI MM I Ml Ml II M M Ml 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 
CGTCAC^TC^TGCAAAAGGTCCTGATTTTGTTCCAGC^WyvCACAGTGC^TGTTCTC^ 312 0 

II IMMM MMM IMM II IM IMIIMM III M, III II 

CGTC^CATC^TGCAAAAGGTCCTGATTTTGTTCC^GCAAAACACAGTGCAATGTTCTCA 312 0 
GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 3180 

MIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIMIIII 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 318 0 



Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 


3961 


Qy 


4021 



ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

I III Ml I MMMI II II MUM I Mi Ml II M 

ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 3240 



lllllllllll Ml IIIEIIIIIIIIIIIIIIIIIMI IIIIIMIIIIIIIIIMI 



GAAAGAAAGAG CAA.TAATAATTAATTCACACACCATATGGATT CTATTTATAAATCACCC 3360 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 
ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 

MMIIMIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIMIIMIII 

ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 
CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 3480 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 348 0 
TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 M 1 1 1 

TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 
TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 
TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGAAACTACACACAAAAAGC^TACTTGC^TTATTTATAATAAAATTGCATTCAGTGGCTT 3660 
TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 

1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 
TTCTTTACATACTCAAAACC^GATAGAAAAAGGTGCTATCGTTC^ACTTCAAAACATGT 378 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 3780 
TTC CTAGTATTAAGGACTTTAATATAG CAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 

IIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIMIII 

TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 
CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3900 

1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3900 
GTGGATGTATGTTCAAACAC CTTTTAGTATTGATAG CTTACATATGG CCAAAGGAATACA 3960 

1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 r 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 r 1 1 1 1 

GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3960 
GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4020 

IIMIIIIIIIIIIIIIIIIIMIIIMMIIMIIIIMIIIIIIIIIIMIIMIIII 

GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4 020 
4 021 AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 080 



Db 


4021 


Qy 


4081 


Db 


4081 


Qy 


4141 


Db 


4141 


Qy 


4201 


Db 


4201 


Qy 


4261 


Db 


4261 



IIMIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

AAAAAATTATATATCTGGGAGGATTTTTTGGTTG CCTAAAGTGG CTATAGTTACTGATTT 
TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 

IIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 
ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 

II III I II I IIMI II III hill I III Mill III ! 

ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 
CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTCAATT 

Ml II M Ml ll Mi MMI III I MMI II IMM MM 

CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTCAATT 
AAAATGCCACATTTCTGGTCTCTGGG 4286 

IMIIIMIIIIMIIIIIIIIIIII 



RESULT 5 

US-10-225-567A-113 

Sequence 113, Application US/10225567A 
Publication No. US20030113798A1 
GENERAL INFORMATION: 
APPLICANT: Lifespan Biosciences 
APPLICANT: Brown, Joseph P. 
APPLICANT: Burmer, Glenna C. 
APPLICANT: Roush, Christine L. 

TITLE OF INVENTION: ANTIGENIC PEPTIDES AND ANTIBODIES FOR G PROTEIN -COUPLED 
RECEPTORS (GPCRS) 

FILE REFERENCE: 1920-4-4 

CURRENT APPLICATION NUMBER: US/10/225 , 567A 
CURRENT FILING DATE: 2001-12-19 
PRIOR APPLICATION NUMBER: 60/257,144 
PRIOR FILING DATE: 2000-12-19 
NUMBER OF SEQ ID NOS : 2292 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 113 
LENGTH: 4286 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-225-567A-113 

Query Match 99.6%; Score 4284.4; DB 15; Length 4286; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4285; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

II MM M 

Db 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

Qy 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 

I M M MIMMII I MIMM II II M IMMIMMI M 

Db 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 



Qy 121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 18 0 



Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 



IMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIMIIIIIIMMIIIIIIIIIII 

AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 18 0 
AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

Ml lllllllllll IN I MUM lllllllllll MMMMIMIMM 

AACTTGGCTCTGAAACTGCGCAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 
CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 

IIMMIMIMIIIIIIIIIIMMIIIIMIIIIMIIMIMIIIIIIIIIIIIIM 

CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 
TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

1 1 1 II 1 1! I II 1 1 1 1 II I II II 1 1 1 MM! 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 Ml INI 

TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 
ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 420 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 
CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 8 0 

1 1 1 1 1 1 1 II 1 1 1 1 1 i 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 

CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 8 0 
CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 

MiiiiiiiiiiiMiiMiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 6 0 0 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 M I II I 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 



MM MMMMMMMMMMIMM MMMMMMIIMM III 



CTG CTGGCAGAGGACTGG CCATTTGGAG CTGAGATGTGTAAG CTGGTG CCTTTCATACAG 780 

IMIMIIIIIIIIMIMMIIIIIIIIIIIIIIMMIIIIIIIIIIIIIMIIIIII 

CTG CTGG CAGAGGACTGG CCATTTGGAGCTGAGATGTGTAAG CTGGTGCCTTTCATACAG 780 
AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 

IIIMIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIMIMIMIIIMIMI 

AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 
G CTGTTG CTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAG CAGTAGAA 900 

MIMIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIII 

G CTGTTG CTTCTTGGAGTAGAATTAAAGGAATTGGGGTT CCAAAATGGACAG CAGTAGAA 900 
ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

: I : M iM, . I II M M : I I : 

ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 
ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 102 0 

IIIIIIIIIIIIIIIIIMIIMIIIIIIMIMIIIIIIIIIIIIIIIIIIMIIIIII 



Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qv 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 



961 ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 1020 
AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 108 0 

1 1 1 1 1 1 1 1 1 II II 1 1 1 1 Ml 1 1 II! 1 1 1 1 II 1 1 1 1 11,1 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 1 

AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 108 0 
TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 

IMIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIII 

TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 
AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 1200 

MMI MMI IMI IMI III II Ml IMM Ml MMIMI 

AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 1200 
GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 

I M I MI M MUM MM IM I II II Ml Mill III I 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 
AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 132 0 

I M MM M IM IMI IMM IMM MM Mill Ml M 

AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 132 0 
AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 

! ■ I I I II l I m M 'I i I , I 'I 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 
AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 

IMIIMIIIIIIMIIIIIMIMMIIIIIIIIIIIIIIMIIIIIIIIIMIIIIII 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 
TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIMMIIIIIIIIIM 

TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 
AAGTTCAAAGCTAATGATC^CGGATATGAC^CTTCCGTTCC^GTAATAAATACAGCTCA 1560 

MMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIMI 

AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 
TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 

MIIMIIIIIIIIIIIIIIIIIMIIIMIIIIIIMIIIMIIIIIIIIMIIIIMI 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 
AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTATGTATTTGC^ 1680 

M MIMM II II I II I II II I MM II M 

AACAAAATGAAACATTTGCCAAAACAAAACAA 1680 
TAAAATATTAAGTGTAATTATTTTAAC^CTCACAGCTACATATGACATTTTATGAGCTGT 174 0 

: ! I i ! I i ' i ; ; I . : 1 1 ■ I 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 
TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 

MM I II MIMI II I IMMI IMIMI IMMMIM 

TTACGGCATGGAAAGAAAAT CAGTGGGAATTAAGAAAG C CTCGTCGTGAAAG CACTTAAT 1800 
TTTTTACAGTTAGC^CTTCAACATAGCTCTTAAC^CTTCC^GGATATTCAC^C^ACACT 1860 

IIMIIIIMMIIIIIIIIMIIIIIIIIIIMIIMIIIMIIIIIIIIMIIIIIII 

TTTTTACAGTTAGC^CTTC^CATAGCTCTTAAC^CTTCCAGGATATTCACACAACACT 1860 



Qy 


1861 


TAGGCTTAAAAATGAG CTCACTCAGAATTT CTATT CTTTCTAAAAAGAGATTTATTTTTA 


1920 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
I 1 1 1 1 1 1 1 1! 1 1 1 II 1 1 1 1 1 IE 1 1 II 1 1 I 1 1 1 1 1 1 II i 11 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 III 




Db 


1861 


TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 


1920 


Qy 


1921 


AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 


1980 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1921 


AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 


1980 


Qy 


1981 


AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 


2040 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 II II 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


1981 


AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 


204 0 


Qy 


2041 


TATCACACTATTAT CAGATTGTAATTAGATG CAAATGAGAGAGCAGTTTAGTTGTTG CAT 


2100 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II 




Db 


2041 


TATCACACTATTAT CAGATTGTAATTAGATG CAAATGAGAGAGCAGTTTAGTTGTTG CAT 


2100 


Qy 


2101 


TTTTCGGAC^CTGGAAAC^TTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 


2160 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2101 


TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAG CAAGG CTGT 


2160 


Qy 


2161 


TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCT 


2220 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2161 


TTTTGAAAATCATTACACTTTCACTAGAAG CC CAAACCTCAGCATTCTGCAATATRTAAf 1 


£i £t £» VJ 


Qy 


2221 


CAACATGTCACAAACAAGCAGC^ 


2280 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2221 


CAACATGT CACAAACAAGCAG CATGTAACAGACTGG CACATGTGCCAGCTGAATTTAAAA 


228 0 


Qy 


2281 


TATAATACTTTTAAAAAGAAAATTATTACATC CTTTACATTCAGTTAAGATCAAAC CTCA 


2340 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 II 1 II II 1 1 




Db 


2281 


TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 


234 0 


Qy 


2341 


CAAAGAGAAATAGAATGTTTGAAAGG CTATC CCAAAAGACTTTTTTGAAT CTGTCATT CA 


2400 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 
1 M 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 




Db 


2341 


CAAAGAGAAATAGAATGTTTGAAAGG CTATC C CAAAAGACTTTTTTGAATrTGTr A TT PA 




Qy 


2401 


CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 


2460 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2401 


CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATrTTrTTTTT 




Qy 


2461 


TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 


2520 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I I I || I I || I I I I | || | | | | | | | 




Db 


2461 


TCACTATCGTAGCTTAAACTCTGTTTGGTTTTnTCATCTGTAAATA CTTAPPTAPATAPA 




Qy 


2521 


CTG CATGTAGATGATTAAATGAGGG CAGGCCCTGTG CTCATAGCTTTACGATGGAGAGAT 


2580 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 II 1 1 1 1 II 
1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 M 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 I 




Db 


2521 


CTG CATGTAGATGATTAAATGAGGG CAGGCCCTGTG CTCATAG CTTTACGATGHAGAG A T 




Qy 


2581 


GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 


2640 






1 1 1 : 1 1 : 1 II ; E III 1 [ 1 1 1 ' ! 1 1 : .1 . [ l MM l [ l | 




Db 


2581 


GCC^GTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCAC^TGACAAAGGG 


2640 


Qy 


2641 


G(^GGTAGCA.CCCTCTCTC^CCC^TGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 


2700 






IIMIIIIIMIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIMI 




Db 


2641 


GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 


2700 



Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Ov 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 



GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 

IIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIMMIIIIIIIMMII 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACZATTTAACAGCTACCTG 2760 

TAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAG CCAATAGAAAAGTTTG CTTG 2820 

IIIIIIIIIIMIIIIMIMIMIIIIIIIIIIIIIMIMIIIIIMIIIMMIIII 

TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 282 0 
ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 

I I M II II M I II II IM IMM MM M II Ml M 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 294 0 

Ml I II II N II I II II IIHMI Ml Mill ,11 I 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 294 0 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 

I MUM IMM IMM MM IMM IM IMIMM II MMI! 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 

. i I' I I : M M; ; M I I M, I LM :li 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 

CGTCACATCAATGCAAAAGGTCCTGATTTTG 312 0 

II M II MM I III M IMMI IM II II III I Ml MM 

CGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAAC^CAGTGCAATGTTCTCA 312 0 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 3180 

IIMIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIII 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 3180 

ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

M IMMI M I IM I II Ml MMM Mill II II Ml M 

ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGC^ 3300 

MMI II IMM! IMM II Ml I Ml M II M MM II I 

TTGTTTTCTGTCAATATTGAATGTGATGGTAC^GTAAACCAAAACCCAAC^ATGTGGCCA 3300 

GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 

M II II III I Ml I MM IMM, II M III I Ml IM I 

GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 

ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIMI 

ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 3420 



IIIIIIIIIIIMIIIIMIIMIIIIIIMIIIIIIIIIIIIIMIIIIIIIIMIMI 



TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

M M l II I II I II IM I M I II Ml II Ml 

TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 



Qy 



3541 TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 



Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 


3961 


Qy 


4021 


Db 


4021 


Qy 


4081 


Db 


4081 


Qy 


4141 


Db 


4141 


Ov 




Db 


4201 


Qy 


4261 


Db 


4261 


RESULT 


6 


US-10- 


020-141 



I :M II II III II M Ml II III III IMMiM M MM 

TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 
TGAAACTACACACAAAAAGCATACTTGCAT 3660 

1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 E i 1 1 1 [[ 1 1 1 !! j 1 1 1 1 1 1 1 E 1 1 1 1 ! 

TGAAACTACACAGAAAAAGCATACTTGCATT^ 3660 
TTTAAAAAAAATGTTTGATTC^AAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 

IMMi II II IMMIMI Ml M MIMM MMMI I II Mill 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 3720 
TTCTTTACATACTCAAAACC^AGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 3780 

IIIIIIIIIIIIIIIIIIIIMIMIMIMIIMIIIIIIIIIIIIIIIIMMIIIII 

TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 3780 
TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 

II I II Ml II MIMM M I M II M II! I II Mill 

TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 
CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3 900 

I M' II II II MM I II I IMMI M III I II IIMI 

CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3 900 
GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 

MMMI II II MM I Ml I MMMI II IM I II II II 

GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 



MUM Mill III III MIMM MM III MIMIIMM II II IM III MIMIII 



AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 080 

MMMM II II MM I III M MIMM II IIMI II II II 

AAAAAATTATATAT CTGGGAGGATTTTTTGGTTGCCTAAAGTGG CTATAGTTACTGATTT 4 080 
TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 

M Mill II IMMI I III M MIMM IIMI II MMMI II 

TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 
ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 

IIIIIIIMIIIIIIIMIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIMI 

ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 4200 
CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTCAATT 4260 

I I Ml II I MM I MM M li II! M IM I II IIMI 

CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTCAATT 4260 



M MMM M M ! 



Sequence 5, Application US/10020141 
Publication No. US20030092013A1 
GENERAL INFORMATION: 



; APPLICANT: McCarthy, Jeanette 
; APPLICANT: Ableson, Allen 

; TITLE OF INVENTION: DIAGNOSIS AND TREATMENT OF VASCULAR DISEASE 
; FILE REFERENCE: MMI-002 

; CURRENT APPLICATION NUMBER: US/10/020,141 

; CURRENT FILING DATE: 2001-12-14 

; PRIOR APPLICATION NUMBER: US 60/313,097 

/ PRIOR FILING DATE: 2001-08-16 

; PRIOR APPLICATION NUMBER: US 60/327,485 

; PRIOR FILING DATE: 2001-10-05 

; NUMBER OF SEQ ID NOS : 21 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 5 

LENGTH: 183337 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-020-141-5 

Query Match 66.1%; Score 2841.8; DB 15; Length 183337; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 2854; Conservative 0; Mismatches 2; Indels 1; Gaps 1; 



Qy 1430 AGTCATGCTTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGC 14 89 

i i ii m ii ii iimi ii h ii i mum i sun u 

Db 7283 0 AGTCATGCTTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGC 72771 

Qy 14 90 AGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATA 154 9 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 72770 AGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATA 72 711 

Qy 155 0 AATACAGCTCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACC 1609 

I I I I ! I ! I I I I ' : . I I I : i ^ 

Db 72710 AATACAG CT CATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACC 72651 

Qy 1610 GAAGTCATTAAAACAAAATGAAACATTTGCCAAAAC 1669 

IIIIMIIIMMMIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIII 

Db 7265 0 GAAGTCATTAAAACAAAATGAAACATTTGCCAA 72591 

Qy 167 0 CAGCACACTATTAAAATATTAAGTGTAATTATTTTAACACTCACA^ 172 9 

I MIMI I MM I Ml I M I MMM II IMMI II M I 

Db 725 90 CAGCAC^CTATTAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATT 72 531 

Qy 173 0 TTATGAGCTGTTTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGA 178 9 

MMIIIIMIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIMIIIIIIMIII 

Db 7253 0 TTATGAG CTGTTTACGG CATGGAAAGAAAATCAGTGGGAATTAAGAAAGC CTCGT CGTGA 72471 

Qy 17 90 AAGCACTTAATTTTTTACAGTTAGC^CTTC^CATAGCTCTTAACAACTTCCAGGATATT 184 9 

M 11/11 II I llllll IIMI MM I; M llllll II III M 

Db 7247 0 AAGCACTTAATTTTTTACAGTTAGC^CTTCAACATAGCTCTTAACAACTTCCAGGATATT 72411 

Qy 1850 CACACAACA.CTTAGGCTTAAAAATGAGCTC^CTCAGAATTTCTATTCTTTCTAAAAAGAG 1909 

MIIIIIIIMIIIIIIIIMIMIIIIIIIIIMMIIMIIIIIIIIIIIIIMIIM 

Db 72410 CACAO^CACTTAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAG 72351 

Qy 1910 ATTTATTTTTAAATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGA 1969 

IIIIMIIIIIIMIMIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIII 

Db 72350 ATTTATTTTTAAATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGA 722 91 



Qy 1970 ACTTTTAAATGAAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTT 202 9 

II MM I III II M IMMMMIMMl Ml II MIMI III I 

Db 722 90 ACTTTTAAATGAAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAA.CTTT 72231 

Qy 2 030 TCAATTAATATTATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTT 2089 

IIIIMIIIIMIIMIIIIMIIMIIIIMIIIIIIIIIIIMIIIIIMIIIIMII 

Db 7223 0 TCAATTAATATTATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTT 72171 

Qy 2 090 AGTTGTTGCATTTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAG 214 9 

I M I M IM M III II I IM II IM MMI Ml M 

Db 7217 0 AGTTGTTGCATTTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAG 72111 

Qy 215 0 AGCAAGGCTGTTTTTGAAAATC^TTACACTTTC^CTAGAAGCCCAAACCTCAGCATTCTG 22 09 

IIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIM 

Db 72110 AGCAAGGCTGTTTTTGAAAATC^TTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTG 72 051 

Qy 2210 CAATATGTAACCAACATGTCACAAA 2269 

MIIIIMIIIIIIIIIIIMIIIIIIIIIIIIMIIIIMIMIIIIIMIIIIIMM 

Db 72 050 CAATATGTAACCAACATGTCACAA^ 71991 

Qy 2270 TGAATTTAAAATATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAG 2329 

M illill llh II II ill ilhlMII llhlll :l III II I 

Db 71990 TGAATTTAAAATATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAG 71931 

Qy 2330 ATCAAAC CTCACAAAGAGAAATAGAATGTTTGAAAGG CTATCCCAAAAGACTTTTTTGAA 2389 

IIIIIIIIMMIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIMI 

Db 71930 ATCAAACCTCACAAAGAGAAATAGAATGTTTGAAAGG CTATCCCAAAAGACTTTTTTGAA 71871 

Qy 23 90 TCTGTCATTCACATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAA 244 9 

M I III II II il Ml MMMMII MINI II II ill M 

Db 71870 TCTGTCATTCACATACCCTGTGAAGACAATACTAT 71811 

Qy 24 50 ATCTTCTTTTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTT 2509 

III I II II II II Ml M III MMMI M MM II M 

Db 71810 ATCTTCTTCTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTT 71751 

Qy 2510 ACCTACATACACTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTAC 2569 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIMIIIII 

Db 71750 ACCTACATACACTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTAC 71691 

Qy 2570 GATGGAGAGATGCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCAC 262 9 

MIMIIIMIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIII 

Db 71690 GATGGAGAGATGCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCAC 71631 

Qy 263 0 ATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGC 268 9 

IIMMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIII 
Db 71630 ATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGC 71571 

Qy 2690 ATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTA 274 9 

I Ml M 1 1 1 1 1 III. II 1 1 I I ill I 1 1 1 1 1 

Db 71570 ATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTA 71511 

Qy 2750 ACAGCTACCTGTAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAA 2809 

II IMM II II M Ml II I II II IM II II Ml MM 

Db 71510 ACAGCTACCTGTAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAA 714 51 



Qy 2810 AAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGT 2869 

llllllllllllill llllli lllllllllll MUM llllllllllllllll 
Db 714 50 AAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGT 71391 

Qy 2870 AAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCT 2 92 9 

MIIIMIIMIIIMIIIIIIIIIIIIMIIIMIIIIIIIMIIIIIIIIIIMIIII 

Db 71390 AAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCT 71331 

Qy 2 93 0 TAGGATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAG 2 98 9 

: : I: I J I I' I ; I Ml M 

Db 7133 0 TAGGATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAG 71271 

Qy 2 990 GAAATGAGGTGGGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTT 304 9 

i I M 1 1 1 1 M II 1 1 1 1 : M 1 1 !| 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1! I MM 1 1 1 1 1 1 1 

Db 7127 0 GAAATGAGGTGGGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTT 71211 

Qy 3 050 CGTCATTGCCTCGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTG 3109 

IIIMIIIIMIIIIIMIIIMIIMIIIIIIIIEIIIIIIIIIIIiMMIIIII III 

Db 71210 CGTCATTGCCTCGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTG 71151 

Qy 3110 CAATGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAA 3169 

IIIIIIMIIIIIIIIIIIMMMIIIIIMIMIIIIIMIIMIIMMIIIIIMI 

Db 7115 0 CAATGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAA 71091 

Qy 3170 AATATGCCCAAATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATA 3229 

II I II I M I Ml 1 1 M I II I Ml I II 1 1 1 MM I 1 1 1 1 M II M II II I II II M I M 

Db 71090 AATATGCCCAAATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATA 71031 

Qy 3230 AGCTAGTAATGTTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAA 32 8 9 

M ill I III II II II MIMI M II II III I II Ml M 

Db 71030 AGCTAGTAATGTTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAA 70971 

Qy 32 90 CAATGTGGCCAGAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTT 334 9 

II II II II II I 

Db 70970 CAATGTGGCCAGAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTT 70911 

Qy 335 0 ATAAATCACCCACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTT 34 09 

I I II ll MM MM II IL h II .1 IM I II I II 

Db 70910 ATAAATCACCCACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTT 70851 

Qy 3410 ATCATAGAAGTCATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCA 3469 

M II II M II IMIIMM II IM MMMM Ml llllli I III 

Db 70850 ATCATAGAAGTCATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCA 70791 

Qy 3470 CAGTTTATTAATATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACT 352 9 

Ml MM I II I II I MIMI M MM I Ml I II I I 

Db 70790 CAGTTTATTAATATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACT 70731 

Qy 3 53 0 GAATTTTTACATCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATC 3 589 

! i i M I MIMMIIIIIIIIIMIIIMIMIIIIIIIIIIII 

Db 7073 0 GAATTTTTACATCCTGATACCTTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATC 70671 

Qy 3590 TTG C CAAATTTTGAAACTA CACACAAAAAG CATACTTG CATTATTTATAATAAAATTG CA 364 9 

IIIIIIIIMIIIIMIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIII 

Db 70670 TTG CCAAATTTTGAAACTACACACAAAAAG CATACTTG CATTATTTATAATAAAATTG 70611 

Qy 365 0 TTCAGTGGCTTTTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAA 3709 



I 1 IIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 70610 TTCAGTGGCTTTTT-AAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAA 70552 

Qy 3710 ACAATTATAATTTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACT 3769 

IMIIMIIMIIIII III IIIIIIIIIIIIIIMIIIII III IMIIIIIIMMIIII 

Db 70551 ACAATTATAATTTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACT 704 92 

Qy 3770 TCAAAACATGTTTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTG 382 9 

I !h II IMM II Ml II MIM '11111 il ,11, MMI MM 

Db 70491 TCAAAACATGTTTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAA 70432 

Qy 383 0 CATGGATGTTACAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTAT 3889 

M I IM I II I II IM II I II II MMIIMM M I I 

Db 70431 CATGGATGTTACAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTAT 70372 

Qy 3890 CCACTGCTAATGTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCC 394 9 

I II M II M IM I II I II III M II I I! I M 

Db 70371 CCACTGCTAATGTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCC 70312 

Qy 395 0 AAAGGAATACAGTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAA 4 009 

IMIIIIIIMIIIIIMMIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIMIIMM 

Db 70311 AAAGGAATACAGTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAA 70252 

Qy 4 010 TATAACAATGTAAAAAATTATATATCTGGGAGGATTTTTTGGTTG CCTAAAGTGG CTATA 4 069 

I IM I II! I IMM' II M II IM II II M II I I 

Db 70251 TATAACAATGTAAAAAATTATATATCTGGGAGGATTTTTTGGTTG CCTAAAGTGG CTATA 70192 

Qy 4 070 GTTACTGATTTTTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTA 412 9 

mmmii iimimiiiii i mm mimi i imiiii mmmmmi 

Db 70191 GTTACTGATTTTTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTA 70132 

Qy 413 0 CCTTATTTTTCACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAA 4189 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIMIIIII 

Db 70131 CCTTATTTTTCACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAA 70072 

Qy 4190 TATAAATGTGACAAGTGGAC^TTATTTATGTTAAATATACAATTATCAAGCAAGTATGAA 424 9 

II IMM IMM Ml 'I II M II 'I Ml IM I M M, I 

Db 70071 TATAAATGTGACAAGTGGACATTATTTATGTTAAATATAC^^ 70012 

Qy 425 0 GTTATTCAATTAAAATGCCACATTTCTGGTCTCTGGG 4286 

II IM I IM I M I II I I 

Db 70011 GTTATTCAATTAAAATGCCACATTTCTGGTCTCTGGG 69975 



RESULT 7 

US-09-826-509-496 

; Sequence 496, Application US/09826509 

; Publication No. US20030204073A1 

; GENERAL INFORMATION: 

; APPLICANT: Lehmann-Bruinsma , Karin 

; APPLICANT: Liaw, Chen W. 

; APPLICANT: Lin, I -Lin 

; TITLE OF INVENTION: No. US2 003 02 04 073A1 -Endogenous , Constitut ively Activated 
Known G 

; TITLE OF INVENTION: Protein-Coupled Receptors 
; FILE REFERENCE: AREN-2 07 

; CURRENT APPLICATION NUMBER : US/09/826,5 09 



CURRENT FILING DATE : 2001-04-05 
PRIOR APPLICATION NUMBER: 60/195,747 
PRIOR FILING DATE: 2000-04-07 
PRIOR APPLICATION NUMBER: 09/170,496 
PRIOR FILING DATE: 1998-10-13 
NUMBER OF SEQ ID NOS : 589 
SOFTWARE: Patent In Version 2.1 
SEQ ID NO 496 
LENGTH: 132 9 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-826-509-496 

Query Match 3 0.8%; Score 1322.6; DB 13; Length 132 9; 

Best Local Similarity 99.7%; Pred. No. l.le-257; 

Matches 1325; Conservative 0; Mismatches 4; Indels 0; Gaps 0; 
Qy 23 8 ATGCAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 2 97 

! I I ! ii :! I ; : I; I ;l II ll i 

Db 1 ATGCAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 60 

Qy 2 98 CTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTG 357 

MIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIM lllllllllllllllllllll 

Db 61 CTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCCGACAGGGCCACTCCGCTTTTG 12 0 

Qy 358 CAAACCGC^GAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCC 417 

IIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIII 

Db 121 O^CCGCAGAGATAATGACGCC^CCC^CTAAGACCTTATGGCCa^AGGGTTCCAACGCC 180 

Qy 418 AGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCT 477 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 181 AGTCTGG CG CGGTCGTTGG CACCTGCGGAGGTGC CTAAAGGAGACAGGACGGCAGGATCT 24 0 

Qy 4 78 CCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 537 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

Db 241 CCGCCACGCACC^TCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 300 

Qy 538 AAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCC 597 

I MI'll II IMilM II III IMIIL I ill Ml IM || 

Db 3 01 AAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCC 360 

Qy 598 ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 657 

II Ml MM II II III MM IIMII Ml I II IM II II 

Db 361 ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 42 0 

Qy 658 GCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTA^ 717 

I M II Ml I II, II Ml IM IM I II II !MMI 

Db 421 GCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTAC 480 

Qy 718 AAG CTGCTGG CAGAGGACTGG CCATTTGGAGCTGAGATGTGTAAG CTGGTG CCTTTCATA 777 

II Ml M MM I II IM IM II II IM MMI Ml Ml 

Db 481 AAG CTGCTGGCAGAGGACTGG C CATTTGGAGCTGAGATGTGTAAGCTGGTGC CTTTCATA 54 0 

Qy 778 CAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATAT 837 

M MMMIMIIMM II Ml, IIMII Ml M i I III II II 

Db 541 CAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATAT 600 



Qy 


838 


Db 


601 


Qy 


898 


Db 


661 


Qy 


958 


Db 


721 


Qy 


1018 


Db 


781 


Qy 


1078 


Db 


841 


Qy 


1138 


Db 


901 


Qy 


1198 


Db 


961 


Qy 


1258 


Db 


1021 


Qy 


1318 


Db 


1081 


Qy 


1378 


Db 


1141 


Qy 


1438 


Db 


1201 


Ov 


-L *± J O 


Db 


1261 


Qy 


1558 


Db 


1321 



CGAG CTGTTG CTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAG CAGTA 897 

IMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIII 

CGAG CTGTTG CTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAG CAGTA 660 
GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 957 

I! MMI II II IIMMMII MM II IM III M III I Ml 

GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 720 
GATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTT 1017 

M II M II II Ml II I Ml II M II: M III I II 

GATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTT 780 
CAGAAGAC^GCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTC 1077 

I II 11 II I MM I Ml I Ml INM II M II I I 

CAGAAGACAGCTTTCATGCAGTTTTAC^AGACAGCAAAAGATTGGTGGCTGTTCAGTTTC 84 0 

ItlllmmTim^ 1137 

TATTTCTGCTTGCC^TTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATG 900 
TTGAGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAA 1197 

II MM I Ml II II I llllll Mill M IMIMMM Mill 

TTGAGAAAGAAAAGTGGCA.TGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAA 960 
GTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCAC 1257 

Ml Mllllllll lllllllllllllllll IIIIIIMMIIMMMMMIMM 

GTGAAGAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCAC 102 0 



II M II I Ml M. Ml I II IM M III I II I I 



TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 1377 

M Ml MUM I III I Ml MM| |M M MMMMI M I 

TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 114 0 

ATTAACCCAATTG CT CTGTATTTGGTGAG CAAAAGATTCAAAAACTG CTTTAAGTCATG C 1437 

M MMI II Mi II I MUM Mill II Ml llllll MM 

ATTAACCCAATTG CT CTGTATTTGGTGAG CAAAAGATTCAAAAACTG CTTTAAGTCATG C 12 00 

TTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGC 14 97 

MM II M II I Ml MIMI I II II I MM M III I Ml 

TTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGC 12 60 



M IMM MM I II I M MMI II, M IM I II M II 

TTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGC 
TCATCTTGA 1566 



RESULT 8 

US-09-778-927A-27 

; Sequence 27, Application US/09778927A 
; Patent No. US20020068342A1 



GENERAL INFORMATION: 
APPLICANT: KHOSRAVI , Rami et al . 

TITLE OF INVENTION: NOVEL NUCLEIC ACID AND AMINO ACID SEQUENCES AND NOVEL 
TITLE OF INVENTION: VARIANTS OF ALTERNATIVE SPLICING 
FILE REFERENCE: 2786-0160P 

CURRENT APPLICATION NUMBER: US/09/778 , 927A 
CURRENT FILING DATE: 2001-02-08 
PRIOR APPLICATION NUMBER : IL 134453 
PRIOR FILING DATE: 2000-02-09 
PRIOR APPLICATION NUMBER: IL135341 
PRIOR FILING DATE: 2000-03-29 
NUMBER OF SEQ ID NOS : 81 
SOFTWARE : Patent In Ver . 2.1 
SEQ ID NO 27 
LENGTH: 800 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

NAME /KEY : misc_feature 
LOCATION: (1) . . (800 ) 

OTHER INFORMATION: n = a,c,g,t any unknown or other 
US-09-778-927A-27 

Query Match 17.7%; Score 763.2; DB 9; Length 800; 

Best Local Similarity 98.3%; Pred. No. 1.9e-144; 

Matches 771; Conservative 0; Mismatches 13; Indels 0; Gaps 0, 
Qy 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIMIMMII 

Db 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

Qy 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 120 

Qy 121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 18 0 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIMM 

Db 121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 180 

Qy 181 AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 240 

IIIIMIIMIIIIIIIIIIIIIIIIIIMIIIIMIIMIIIIIIIIIMIMIIIIII 

Db 181 AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

Qy 241 CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 3 00 

IIIIMIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

Db 241 CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 3 00 

Qy 3 01 TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

IIIIIIIIMMIIIIMMMIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIMII 

Db 301 TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 3 60 

Qy 361 ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 420 

IMIIMIMIMIIIIIIIIIIIMIIIIIIIIMIIIIIMIIIMIIMIIMIIII 

Db 361 ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 420 

Qy 421 CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 

IIIIIIIIIIIMI IIIIIIIIIIIIIIMI llllllllilllll! IMIIIII 



Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Ov 


721 


Db 


721 


Qy 


781 


Db 


781 



421 CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 

CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 

i i . - 1 i I ! . I : . . - I i i : : ! I i 

CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

[ i ; J I ■ ! I . ) II Ml : 1 1 : 1 1 i i ■ . i i i j ii j i . i i ■ . i i . 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

MIIIIIMIIIMIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIMIMMIM 

CTTCTGAGAATTAT CTACAAGAACAAGTGCATG CGAAACGGTCC CAATATCTTGATCG CC 660 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 72 0 

IIIIIIIIMIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 72 0 

CTGCTGG CAGAGGACTGG C CATTTGGAG CTGAGATGTGTAAG CTGGTGC CTTTCATACAG 78 0 

MIIIIIIIIMMIIIIIIIIIIIIIIMIMIIIII MM I 1 1 1 1 II 

CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGCCAGGTAGGAGCGTTCACCCAC 78 0 



II 



RESULT 9 

US-09-962-436-531/C 

; Sequence 531, Application US/09962436 

; Patent No. US20020081301A1 

; GENERAL INFORMATION: 

; APPLICANT: Soppet , Daniel 

; TITLE OF INVENTION: Cancer Gene Determination and Therapeutic Screening Using 

Signature Gene 

; TITLE OF INVENTION: Sets 

; FILE REFERENCE: 689290-75 

; CURRENT APPLICATION NUMBER: US/09/962 , 436 

CURRENT FILING DATE: 2001-09-25 

PRIOR APPLICATION NUMBER: US/60/235 , 082 

PRIOR FILING DATE: 2000-09-25 
; PRIOR APPLICATION NUMBER: US/60/234 , 924 
; PRIOR FILING DATE: 2000-09-25 
; NUMBER OF SEQ ID NOS : 568 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 531 
LENGTH: 5 92 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/KEY: misc_f eature 
OTHER INFORMATION: n=a,t,g or c 
US-09-962-436-531 

Query Match 10.2%; Score 440.6; DB 9; Length 592; 

Best Local Similarity 90.1%; Pred. No. 3.6e-79; 

Matches 525; Conservative 0; Mismatches 47; Indels 11; Gaps 5; 



Qy 993 GCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCTTTCATGCAGTTTTACAAGACAGC 1052 

I I i III II 1 1 1 1 1 1 i I II 1 1 M 1 1 M 1 1 1 1 II II! I I II I I 

Db 578 GGGGAACTGGCTGGCTCATCCNGTTCAGAAGACAGCTTNCAGCCAG - - TTNCCAGGCCGG 521 

Qy 1053 AAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTGCCATTGGCCATCACTGCATTTTT 1112 

III llllllllll 1 1 1 1 I III I I Mill 1 III 1 1 1 1 

Db 520 CAAAAGATGGTGGCTGTCCAGTNC - - AATTCCGGCTTTNCATGGCCATCACNGCA - TTTT 464 

Qy 1113 TTATACACTAATGAC CTGTGAAATGTTGAGAAAGAAAAGTGG CATGCAGATTGCTTTAAA 1172 

l:llll I 11 II I : i II MM: I II 11,11 

Db 463 TTATACACTAANGAC CTGTGAAATGTTGAGAAGGAAAAGTGG CATG CAGATTGCTTTAAA 4 04 
Qy 1173 TG ATCACCTAAAGCAGAGACGGGAAGTGGCCAAAA CCGTCTTTTGCCTGGTCCT 1226 

II I I II Mill Ml IMIIIII Ml I MMMMMMIMM 

Db 4 03 TGGACCACCCTAAAGCCAGAGCCGGNAAGTGGCCCAAAANCCGGTCTTTTGCCTGGTCCT 344 

Qy 1227 TGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCTGAAGCTCACTCTTTA 128 6 

1 1 1 M M II I II I II 1 1 II 1 1 1 II II II M 1 1 II II M 1 1 II II I II I M I II I II 1 1 1 

Db 343 GGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCTGAAGCTCACTCTTTA 284 

Qy 1287 TAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTGTTGGTATTGGACTATAT 134 6 

IMMMMMMMMMMMMMMIMMMMMMMMMMMMMMM 

Db 283 TAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTGTTGGTATTGGACTATAT 224 

Qy 1347 TGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAATTGCTCTGTATTTGGTGAG 14 06 

I M II IMIMMI IM II Ml II II I IMM MM IMM 

Db 223 TGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAATTGCTCTGTATTTGGTGAG 164 

Qy 14 07 CAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTGGTGCCAGTCATTTGAAGA 14 66 

I Ml II IMIM II IM II M M MM I IMM MM IMM 

Db 163 CAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTGGTGCCAGTCATTTGAAGA 104 

Qy 14 67 AAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATA 152 6 

I M I M I I I II II I I I I I I I II I I I II II II I I I II I I I I I II I I I I I I II I I I I I I I I I 
Db 103 AAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATA 44 

Qy 1527 TGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAG 1569 

I I II II I II I II I I I II I I I II I I I M II II I II M I I I I II I 
Db 43 TGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAG 1 



RESULT 10 

US-09-880-107-2060/C 

; Sequence 2060, Application US/09880107 

; Patent No. US20020142981A1 

; GENERAL INFORMATION: 

; APPLICANT: Horne, Darci T. 

; APPLICANT: Vockley, Joseph G. 

; APPLICANT: Scherf, Uwe 

; APPLICANT: Gene Logic, Inc. 

; TITLE OF INVENTION: Gene Expression Profiles in Liver Cancer 

; FILE REFERENCE: 44 921 -5028 -WO 

; CURRENT APPLICATION NUMBER: US/09/88 0 , 107 

; CURRENT FILING DATE: 2001-06-14 

PRIOR APPLICATION NUMBER: US 60/211,379 
; PRIOR FILING DATE: 2000-06-14 



PRIOR APPLICATION NUMBER: US 60/237,054 
PRIOR FILING DATE: 2000-10-02 
NUMBER OF SEQ ID NOS : 3950 
SOFTWARE: PatentlnVer. 2.1 
SEQ ID NO 2060 
LENGTH: 592 
TYPE: DNA 

ORGANISM: Homo sapiens ' 
FEATURE : 

OTHER INFORMATION: Genbank Accession No. US20020142981A1 H97587 
NAME/ KEY: unsure 
LOCATION: (1) . . (592) 

OTHER INFORMATION: n = a or c or g or t 
US-09-880-107-2060 

Query Match 10.2%; Score 440.6; DB 10; Length 592; 

Best Local Similarity 90.1%; Pred. No. 3.6e-79; 

Matches 525; Conservative 0; Mismatches 47; Indels 11; Gaps 5; 

GCGAATCTGCTTGCTTCATCCCGTTC^GAAGACAGCTTTCATGCAGTTTTACAAGACAGC 1052 

i i i iii ii mill ii m ii i ii in ii i ii i i 

GGGGAACTGGCTGGCTCATCCNGTTCAGAAGACAGCTTNCAGCCAG - - TTNCCAGGCCGG 521 
AAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTGCCATTGGCCATCACTGCATTTTT 1112 

III 11:1 1 1 1 1 I Ml I I 11,11 Ml 1 1 1 1 

CAAAAGATGGTGGCTGTCCAGTNC - -AATTCCGGCTTTNCATGGCCATCACNGCA-TTTT 4 64 
TTATACACTAATGACCTGTGAAATGTTGAGAAAGAAAAGTGGCATGCAGATTGCTTTAAA 1172 

lllllllllll I' II MM II: Mill III II II' II ME II ;l 

TTATACACTAANGACCTGTGAAATGTTGAGAAGGAAAAGTGGCATGCAGATTGCTTTAAA 4 04 
TG ATCACCTAAAG CAGAGACGGGAAGTGG CCAAAA CCGTCTTTTGCCTGGTCCT 1226 

II I I II Mill 1 1 1 MINIM III I MIMMIIMMIIM 

TGGACCACCCTAAAGCCAGAGCCGGNAAGTGGCCCAAAANCCGGTCTTTTGCCTGGTCCT 344 
TGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCTGAAGCTCACTCTTTA 1286 

II Ml II M II Ml M II Ml IMMIMII III Mi 

GGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCTGAAGCTCACTCTTTA 284 
TAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTGTTGGTATTGGACTATAT 134 6 

Ml II III III II IMMI II IMMI II Ml Ml Ml M 

TAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTGTTGGTATTGGACTATAT 224 
TGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAATTGCTCTGTATTTGGTGAG 1406 

M Ml Ml IMMI MM II IMMM IM II, II II Ml Ml M 

TGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAATTGCTCTGTATTTGGTGAG 164 

C^AAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTGGTGCCAGTCATTTGAAGA 14 66 

M I I M M M M II I I M II II I I II I II I II II M I I II I I II M I I II I I II M I I II 
CAAAAGATTC^W^ACTGCTTTAAGTCATGCTTATGCTGCTGGTGCCAGTCATTTGAAGA 104 

AAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATA 1526 

Ml II Ml III II Ml Ml II MM II MMMI IM III 

AAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATA 4 4 
TGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAG 1569 

Ml II Ml II II Ml III M Ml II 



Qy 


993 


Db 


578 


Qy 


1053 


Db 


520 


Qy 


1113 


Db 


463 


Qy 


1173 


Db 


403 


Qy 


1227 


Db 


343 


Qy 


1287 


Db 


283 


Qy 


1347 


Db 


223 


Qy 


1407 


Db 


163 


Qy 


1467 


Db 


103 


Qy 


1527 



Db 43 TGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAG 1 



RESULT 11 

US-09-954-531-917/c 

; Sequence 917, Application US/09954531 

; Patent No. US20020165180A1 

; GENERAL INFORMATION: 

; APPLICANT: Weaver, Zoe 

; TITLE OF INVENTION: Process for Identifying Anti-Cancer Therapeutic Agents 
Using Cancer 

; TITLE OF INVENTION: Gene Sets 
; FILE REFERENCE: 68 92 90-77 

; CURRENT APPLICATION NUMBER: US/ 09/954 , 53 1 

; CURRENT FILING DATE: 2002-05-02 

; PRIOR APPLICATION NUMBER: US/60/233 , 133 

; PRIOR FILING DATE: 2000-09-18 

; PRIOR APPLICATION NUMBER: US/60/234 , 009 

; PRIOR FILING DATE: 2000-09-20 

; PRIOR APPLICATION NUMBER: US/60/234,034 

PRIOR FILING DATE: 2000-09-20 

PRIOR APPLICATION NUMBER: US/60/234 , 5 09 

PRIOR FILING DATE: 2000-09-22 
; PRIOR APPLICATION NUMBER: US/60/234 , 567 
; PRIOR FILING DATE: 2000-09-22 
; NUMBER OF SEQ ID NOS : 13 92 
; SOFTWARE: Patentln version 3.0 
; SEQ ID NO 917 
LENGTH: 5 92 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/KEY : misc_f eature 
LOCATION: (1) . . . (592) 
OTHER INFORMATION: n=a , t , g or c 
US-09-954-531-917 



Query Match 10.2%; Score 44 0.6; DB 10; Length 592; 

Best Local Similarity 90.1%; Pred. No. 3.6e-79; 

Matches 525; Conservative 0; Mismatches 47; Indels 11; Gaps 5; 



Qy 


993 


GCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCTTTCATGCAGTTTTACAAGACAGC 

1 1 1 Ml M MINI MM II II II Mill Ml Ml II 1 II II 

GGGGAACTGGCTGGCTCATCCNGTTCAGAAGACAGCTTNCAGCCAG - - TTNCCAGGCCGG 


1052 


Db 


578 


521 


Qy 


1053 


AAAAGATTGGTGG CTGTTCAGTTT CTATTTCTG CTTG CCATTGGC CATCACTG CATTTTT 

III llllllllll MM III MMIMI Ml MM 

CAAAAGATGGTGGCTGTCCAGTNC - - AATTCCGGCTTTNCATGGCCATCACNGCA- TTTT 


1112 


Db 


520 


464 


Qy 


1113 


TTATACACTAATGAC CTGTGAAATGTTGAGAAAGAAAAGTGG CATG CAGATTG CTTTAAA 

IIIIIIIMM MIMMMMMMIMM J 1 1 1 1 ! 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 f 

TTATACACTAANGACCTGTGAAATGTTGAGAAGGAAAAGTGGCATGCAGATTGCTTTAAA 


1172 


Db 


463 


404 


Qy 


1173 


TG- - - ATCACCTAAAGCAGAGACGGGAAGTGGCCAAAA - - -CCGTCTTTTGCCTGGTCCT 

II „ 1 1 H Mill Ml llllllll III 1 Mill Mill 

TGGACCACCCTAAAGCCAGAGCCGGNAAGTGGCCCAAAANCCGGTCTTTTGCCTGGTCCT 


1226 


Db 


403 


344 



Qy 122 7 TGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCTGAAGCTCACTCTTTA 1286 

MIIIIIIIIIIIMIIIIIIIMIIIIIIIMMIMIIIIIIIMIIIIIMIIIII 

Db 343 GGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCTGAAGCTCACTCTTTA 284 

Qy 12 87 TAAT CAGAATGATC C CAATAGATGTGAACTTTTGAG CTTTCTGTTGGTATTGGACTATAT 1346 

M ! .! ' !' !M M I I lllli II: 

Db 2 83 TAATCAGAATGATCC CAATAGATGTGAACTTTTGAG CTTT CTGTTGGTATTGGACTATAT 224 

Qy 134 7 TGGTATCAACATGGCTTCACTGAATTCCTGC^TTAACCCAATTGCTCTGTATTTGGTGAG 14 06 

! 1 1 1 1 1 1 II ! II 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1! I 1 1 1 1 1 1 1 1 1 I Ml 

Db 223 TGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAATTGCTCTGTATTTGGTGAG 164 

Qy 14 07 CAAAAGATTCAAAAACTGCTTTAAGTCATG CTTATG CTG CTGGTGCCAGTCATTTGAAGA 1466 

I I II I MM'II M II I Ml II I! 'IMM Jl I I 

Db 163 CAAAAGATTCAAAAACTGCTTTAAGTCATG CTTATG CTG CTGGTGCCAGTCATTTGAAGA 104 

Qy 14 67 AAAACAGTCCTTGGAGGAAAAG CAGTCGTG CTTAAAGTT CAAAG CTAATGATCACGGATA 1526 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMMIIMIIIIIIII 

Db 103 AAAACAGTCCTTGGAGGAAAAG CAGTCGTG CTTAAAGTTCAAAG CTAATGATCACGGATA 44 

Qy 1527 TGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAG 1569 

IIIIIIIMIIIIIIIIIMIIIIIIIMIIIIIIIIIMIII 

Db 43 TGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAG 1 



RESULT 12 

US-09-918-995-7311 

; Sequence 7311, Application US/09918995 

; Publication No. US20030073623A1 

; GENERAL INFORMATION: 

; APPLICANT: Hyseq, Inc. 

; TITLE OF INVENTION: NOVEL NUCLEIC ACID SEQUENCES OBTAINED 
; TITLE OF INVENTION: FROM VARIOUS CDNA LIBRARIES 
; FILE REFERENCE: 20411-756 

; CURRENT APPLICATION NUMBER: US/ 09/9 18 , 995 

; CURRENT FILING DATE: 2001-07-30 

; PRIOR APPLICATION NUMBER: US/ 09/235 , 076 

; PRIOR FILING DATE: 1999-01-20 

; NUMBER OF SEQ ID NOS : 38054 

; SOFTWARE: FastSEQ for Windows Version 3.0 

; SEQ ID NO 7311 

LENGTH: 4 05 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-918-995-7311 



Query Match 9.4%; Score 4 03.4; DB 11; Length 4 05; 

Best Local Similarity 99.8%; Pred. No. 9.9e-72; 

Matches 4 04; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 2 062 TAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCATTTTTCGGACACTGGAAACATT 2121 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiin 

Db 1 TAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCATTTTTCGGACACTGGAAACATT 60 

Qy 2122 TAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGTTTTTGAAAATCATTACACTTT 2181 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 61 TAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGTTTTTGAAAATCATTACACTTT 120 



Qy 2182 CACTAGAAGCCOU^CCTCAGCATTCTGCAATATGTAACCAACATGTCACAAACAAGCAG 2241 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 121 CACTAGAAGCCCAAACCTCAGC^^ 180 



Qy 2242 CATGTAACAGACTGG CACATGTG CCAGCTGAATTTAAAATATAATACTTTTAAAAAGAAA 2301 

IIIIMIIIIIIMIIIIMIIIIIIIIMIIIIMIIMIIIMIIIIMIIIIIIIII 

Db 181 CATGTAACAGACTGG CACATGTG CCAG CTGAATTTAAAATATAATACTTTTAAAAAGAAA 24 0 

Qy 23 02 ATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCACAAAGAGAAATAGAATGTTTC 2361 

IMIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIMIIIIII 

Db 241 ATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCACAAAGAG 3 00 

Qy 2362 AAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCACATACCCTGTGAAGACAA.TAC 2421 

11:1 I i : I i I II; II. II I . II : i I : 1 ! 

Db 3 01 AAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA.CATACCCTGTGAAGACAATAC 360 

Qy 2422 TATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTTTCACTA 2466 

Mill III IIMIIII III II I I Mill 

Db 361 TATCTACAATTTTTTCAGGATTATTAAAATCTTCTTCTTTCACTA 4 05 



RESULT 13 
US-09-931-157-1 

; Sequence 1, Application US/09931157 

; Patent No. US20020082414A1 

; GENERAL INFORMATION: 

; APPLICANT: Imura, Hiroo 

; APPLICANT: Nakao, Kazuwa 

; APPLICANT: Nakanishi, Shigetada 

; TITLE OF INVENTION: Human Endothelin Receptor 

; FILE REFERENCE: 299002032411 

; CURRENT APPLICATION NUMBER: US/09/931 , 157 

; CURRENT FILING DATE: 2001-10-15 

PRIOR APPLICATION NUMBER: 08/121,446 
; PRIOR FILING DATE: 1993-09-14 
; PRIOR APPLICATION NUMBER: 07/911,684 
; PRIOR FILING DATE: 1992-07-10 
; PRIOR APPLICATION NUMBER: JP 3-172828 
; PRIOR FILING DATE: 1991-07-12 
; NUMBER OF SEQ ID NOS : 2 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 1 

LENGTH: 4105 

TYPE: DNA 

ORGANISM: Homo Sapiens 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (485) ... (1768) 
NAME /KEY : sig_peptide 
LOCATION: (485) . . . (544) 
NAME /KEY : mat_peptide 
LOCATION: (545) ... (1768) 
US-09-931-157-1 

Query Match 9.2%; Score 395.8; DB 9; Length 4105; 

Best Local Similarity 65.7%; Pred. No. 1.3e-69; 



Matches 634; Conservative 0; Mismatches 307; Indels 24; Gaps 3; 



Qy 505 TGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATCAACACGGTTGTGTCCTGC 564 

I I I I I II III I II.: Mill || | || || 

Db 68 9 TGCCCACAGCAGACTAAAATTACTTCAGCTTTCAAATACATTAACACTGTGATATCTTGT 748 

Qy 565 CTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAAC 624 

llllll 1 1 1 1 II I Mill I II II II II II II III Mill 

Db 74 9 ACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTACCAGAAC 808 

Qy 62 5 AAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTG 684 

II 1 1 III I II II Mill MM MMI I II II MMMM I 

Db 8 09 AAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATC 868 
Qy 685 CACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCC 74 0 

I I MMMM IMMMMMM I IMMMIMI I I IIMM 

Db 869 TATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTT 928 
Qy 741 ATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCC 789 

Mill I I II llllll I II II I Mill MM 

Db 92 9 GATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCG 988 

Qy 7 90 GTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCT 84 9 

Mill Mill II II I II II IIIM Ml Illllll II MM Mill 

Db 989 GTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCC 1048 

Qy 85 0 TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 909 

II llllll I II I II, MM II II I MMMM I 

Db 104 9 TCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCC 1108 

Qy 910 ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 969 

II Ml llllll Mill I MMMM II II II I II I I 

Db 1109 ATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCC 1168 

Qy 970 ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 102 9 

I II II I II II II III MM II II I II 

Db 1169 TTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCC ACATCAAAA 1222 

Qy 1030 TTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 1089 

MUM MM III I I III II MMMM III I MIMIMIM II 

Db 1223 TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 12 82 
Qy 1090 CCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG AGAAAG 1146 

M MM llllll I II II II II Mill Mill llllll MM I 

Db 1283 CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 1342 

Qy 114 7 AAAAGTGG CATG CAGATTGCTTTAAATGATCACCTAAAG CAGAGACGGGAAGTGG CCAAA 12 06 

M I M M Mill I I Ml II II llllll I II MMMM Ml 

Db 1343 AATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAAGCAGCGTCGAGAAGTGGCAAAA 14 02 

Qy 1207 ACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGG 1266 

II M II III MM I I Mill II IIMM I II IIMM I Ml I 

Db 14 03 ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGT 14 62 

Qy 1267 ATTCTGAAG CTCACTCTTTATAATCAGAATGATC CCAATAGATGTGAACTTTTGAG CTTT 1326 

II Mill Ml I MMI I I II II MMMM I I II II 

Db 1463 ATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTC 1522 



Qy 1327 CTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCA 1386 

I II I 1 1 1 1 II II IIIII III 1 1 1 1 I IIIII II II IIIII 

Db 1523 TTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCC 1582 

Qy 1387 ATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGC 144 6 

II lllllllllll Illlllll I III IIIII II II Illlllll I llllil 
Db 1583 ATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGC 1642 

Qy 1447 TGGTG 1451 

Db 1643 TGCTG 1647 



RESULT 14 

US-10-007-926A-229 

Sequence 229, Application US/10007926A 
Publication No. US20030143539A1 
GENERAL INFORMATION: 
APPLICANT: BERTUCCI, FRANCOIS 
APPLICANT: HOULGATTE, REM I 
APPLICANT: BIRNBAUM, DANIEL 
APPLICANT: NGUYEN, CATHERINE 
APPLICANT: VI ENS, PATRICE 
APPLICANT: FERT, VINCENT 

TITLE OF INVENTION: GENE EXPRESSION PROFILING OF PRIMARY BREAST CARCINOMAS 
TITLE OF INVENTION: USING ARRAYS OF CANDIDATE GENES 
FILE REFERENCE: 1546-R-00 

CURRENT APPLICATION NUMBER: US/10/007 , 926A 
CURRENT FILING DATE: 2001-12-07 
PRIOR APPLICATION NUMBER: 60/254,090 
PRIOR FILING DATE: 2000-12-08 
NUMBER OF SEQ ID NOS : 468 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 229 
LENGTH: 4105 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

OTHER INFORMATION: endothelin receptor type a 
US-10-007-926A-229 



(EDNRA) gene. 



Query Match 9.2%; Score 395.8; DB 13; Length 4105; 

Best Local Similarity 65.7%; Pred. No. 1.3e-69; 

Matches 634; Conservative 0; Mismatches 307; Indels 24; Gaps 



3; 



Qy 


505 


TGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATC^ACACGGTTGTGTCCTGC 

MM 1 II 1 II 1 lllllllllllll IIIII II 1 II II 

TGCCCAC^GC^GACTAAAATTACTTCAGCTTTCAAATACATTAACACTGTGATATCTTGT 


564 


Db 


689 


748 


Qy 


565 


CTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAAC 

1 1 III 1 MM II 1 IIIII 1 II II II II II II III IIIII 

ACTATTTTCA.TCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTACCAGAAC 


624 


Db 


749 


808 


Qy 


625 


AAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTG 

II II III 1 II II IIIII MM IIIII 1 II II Illlllll 1 

AAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATC 


684 


Db 


809 


868 



Qy 685 CA.CATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCC 740 

I I llllllll lllllllllllll I Mill I I MMM 

Db 869 TATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTT 928 

Qy 741 ATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCC 78 9 

lllll I I II llllll lllll I lllll I I I I 

Db 92 9 GATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCG 988 

Qy 7 90 GTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCT 84 9 

Mill Mill II II I II II Mill III lllllll II MM lllll 

Db 989 GTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCC 1048 

Qy 850 TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 909 

II MMM I II I MIMIIII MM II II I llllllll I 

Db 104 9 TCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCC 1108 

Qy 910 ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 969 

M Ml I I II I I lllll I llllllll II II II I II I I 
Db 1109 ATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTCGTCATGGTACCC 1168 

Qy 970 ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 102 9 

I II II I II II II III MM II II I II 

Db 1169 TTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCC ACATCAAAA 1222 

Qy 1030 TTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 1089 

MMM MM Ml I I III II llllllll III I lllll II 

Db 1223 TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 12 82 
Qy 1090 CCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG AGAAAG 114 6 

II MM llllll I II II II II lllll Mill MMM MM I 

Db 1283 CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 1342 

Qy 114 7 AAAAGTGGCATG CAGATTG CTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGG CCAAA 12 06 

II I II II lllll I I III II II llllll I II llllllll III 

Db 1343 AATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAAGCAGCGTCGAGAAGTGGCAAAA 14 02 

Qy 1207 ACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGG 1266 

II II M III MM I I Mill II MMM I II MMM I Ml I 

Db 14 03 ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGT 14 62 

Qy 1267 ATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTT 1326 

M lllll III I lllll I I II II llllllll I lllll 

Db 14 63 ATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTC 1522 

Qy 1327 CTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCA 1386 

I M I MM II M lllll III MM I lllllll II II lllll 

Db 1523 TTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCC 1582 

Qy 1387 ATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGC 144 6 

M II I II llllllll I III lllll II II llllllll I llllll 

Db 1583 ATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGC 1642 

Qy 1447 TGGTG 1451 

II II 

Db 1643 TGCTG 1647 



RESULT 15 
US-10-101-510-370 

; Sequence 370, Application US/10101510 

; Publication No. US20030148295A1 

; GENERAL INFORMATION: 

; APPLICANT: WAN, JACKSON 

; APPLICANT: WANG, YIXIN 

; TITLE OF INVENTION: EXPRESSION PROFILES AND METHODS OF USE 
; FILE REFERENCE: 15117.0012 

; CURRENT APPLICATION NUMBER: US/10/ 1 01 , 510 

; CURRENT FILING DATE: 2002-03-20 

; PRIOR APPLICATION NUMBER: 60/276,947 

; PRIOR FILING DATE: 2001-03-20 

; NUMBER OF SEQ ID NOS : 8 05 

SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 370 

LENGTH: 4105 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-101-510-370 

Query Match 9.2%; Score 395.8; DB 13; Length 4105; 

Best Local Similarity 65.7%; Pred. No. 1.3e-69; 

Matches 634; Conservative 0; Mismatches 307; Indels 24; Gaps 3; 



Qy 505 TGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATCAACACGGTTGTGTCCTGC 564 

I I I I I II I II I I II 11:11 ! HIM || | || || 

Db 68 9 TGCCC^CAGCAGACTAAAATTACTTCAGCTTTCAAATACATTAACACTGTGATATCTTGT 74 8 

Qy 565 CTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAAC 624 

I I III I 1 1 1 1 II I Mill I II II II II II II III Mill 

Db 74 9 ACTATTTTCATCGTGGGAATGGTGGGGAATGCAACTCTGCTCAGGATCATTTACCAGAAC 808 

Qy 62 5 AAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTG 684 

II II III I II II Mill MM Mill I M II Ml I I 

Db 8 09 AAATGTATGAGGAATGGCCCCAACGCGCTGATAGCCAGTCTTGCCCTTGGAGACCTTATC 868 

Qy 685 CACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCC 74 0 

I I 1 1 II II II MIMMMMM I MIMIIMM I I llllll 

Db 869 TATGTGGTCATTGATCTCCCTATCAATGTATTTAAGCTGCTGGCTGGGCGCTGGCCTTTT 928 

Qy 741 ATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCC 789 

Mill I I II MUM I II II I Mill MM 

Db 92 9 GATCACAATGACTTTGGCGTATTTCTTTGCAAGCTGTTCCCCTTTTTGCAGAAGTCCTCG 988 

Qy 790 GTGGGAATCACTGTG CTGAGTCTATGTG CTCTGAGTATTGACAGATATCGAG CTGTTGCT 84 9 

Mill Mill M II I II II Mill Ml 1 1 1 1 1 1 1 II MM Mill 

Db 98 9 GTGGGGATCACCGTCCTCAACCTCTGCGCTCTTAGTGTTGACAGGTACAGAGCAGTTGCC 104 8 

Qy 850 TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 909 

M llllll I II I IIIIIMM MM II II I 1 1 1 1 1 1 1 1 I 

Db 104 9 TCCTGGAGTCGTGTTCAGGGAATTGGGATTCCTTTGGTAACTGCCATTGAAATTGTCTCC 1108 

Qy 910 ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 969 

Mill I I II I I Mill I I I 1 I I I I I II II II I II I I 

Db 1109 ATCTGGATCCTGTCCTTTATCCTGGCCATTCCTGAAGCGATTGGCTTrnTPATCnTA CCC 1 1 Gfl 



Qy 


970 


ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 

1 II II 1 II II II III MM II II 1 II 

TTTGAATATAGGGGTGAACAGCATAAAACCTGTATGCTCAATGCC ACATCAAAA 


1029 


Db 


1169 


1222 


Qy 


1030 


TTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 

Ml MM Ml I I Ml || I'll: III I 1 MM III || 

TTCATGGAGTTCTACCAAGATGTAAAGGACTGGTGGCTCTTCGGGTTCTATTTCTGTATG 


1089 


Db 


1223 


1282 


Qy 


1090 


CCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG AGAAAG 

II MM llllll 1 II II II II HIM Mill 11 MM 1 

CCCTTGGTGTGCACTGCGATCTTCTACACCCTCATGACTTGTGAGATGTTGAACAGAAGG 


1146 


Db 


1283 


1342 


Qy 


1147 


AAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAA 

II 1 II II HIM 1 1 III II II llllll 1 II M II Ml 

AATGGCAGCTTGAGAATTGCCCTCAGTGAACATCTTAAGCAGCGTCGAGAAGTGGCAAAA 


1206 


Db 


1343 


1402 


Qy 
Db 


1207 
1403 


ACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGG 

II II II III MM 1 1 HIM II llllll 1 II llllll 1 III 1 

ACAGTTTTCTGCTTGGTTGTAATTTTTGCTCTTTGCTGGTTCCCTCTTCACTTAAGCCGT 


1266 
1462 


Qy 


1267 


ATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTT 

II HIM III 1 HIM 1 1 II II III I I || || 

ATATTGAAGAAAACTGTGTATAACGAAATGGACAAGAACCGATGTGAATTACTTAGTTTC 


1326 


Db 


1463 


1522 


Qy 


1327 


CTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCA 

1 M 1 MM 1 1 1 1 MMI Ml MM 1 IMIMI II II MM! 

TTACTGCTCATGGATTACATCGGTATTAACTTGGCAACCATGAATTCATGTATAAACCCC 


1386 


Db 


1523 


1582 


Qy 


1387 


ATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGC 

II Ml MM II II I Ml Mill M II MMI I II 

ATAGCTCTGTATTTTGTGAGCAAGAAATTTAAAAATTGTTTCCAGTCATGCCTCTGCTGC 


1446 


Db 


1583 


1642 


Qy 


1447 


TGGTG 14 51 




Db 


1643 


II II 

TGCTG 164 7 
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1 
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5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



/ SIDSl/gcgdata/geneseq/geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn- 
/SIDSl/gcgdata/geneseq/geneseqn- 
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/SIDSl/gcgdata/geneseq/geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn 



embl /NA1 980. DAT : * 
embl /NA1 981. DAT : * 
embl / NA1 982. DAT : * 
embl / NA1 983. DAT : * 
embl/NA1984 .DAT:* 
embl/NA1985 . DAT : * 
embl/NA1986 .DAT: * 
embl/NA1987 .DAT: * 
embl/NA1988 .DAT: * 
-embl/NA1989 .DAT: * 
embl/NA1990 .DAT:* 
-embl/NA1991 .DAT:* 
-embl/NA1992 .DAT:* 
-embl /NA1993 .DAT:* 
-embl /MAI 994 .DAT:* 
-embl /NA1 995. DAT:* 
embl /NA1996 . DAT : * 
-embl/NA1997 . DAT: * 
-embl/NA1998 .DAT:* 
- embl /NA1 999. DAT : * 
-embl/NA2000 . DAT: * 
- embl /NA2 0 0 1A . DAT : * 
-embl/NA2 001B . DAT : * 
-embl /NA2002. DAT:* 
-embl/NA2003 .DAT:* 



Pred. No. is the number of results predicted by chance to have a 



score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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AAF21283 
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18 


1248 


29. 


.0 


1578 


25 
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28 . 
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.2 
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.6 
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24 


ABI99321 
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.4 
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13 


AAQ25892 
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AAQ53922 
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763 , 


.2 
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12 , 
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24 


ABK94409 


DNA encoding endot 


26 


440, 


.6 


10, 


.2 
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24 
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27 
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.6 


10. 


.2 


592 


24 


ABL63647 
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28 


440. 
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.2 
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24 


ABL64653 
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15 


AAQ63209 
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30 


395. 


.8 


9. 


.2 
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21 


AAF20903 


Human low adenosin 
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.2 
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AAF20915 
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33 


395. 


.8 


9. 


.2 


1868 


21 


AAA34793 


Human adenosine re 


34 


395. 


.8 


9. 


.2 


2008 


21 


AAF20904 


Human low adenosin 


35 


395. 


.8 


9. 


.2 


2008 


21 


AAA34782 


Human adenosine re 


36 


395. 


. 8 


9. 


.2 


4079 


25 


ACA56659 


Human signalling p 


37 


395. 


.8 


9. 


.2 


4105 


14 


AAQ34583 


ETa receptor gene. 


38 


395. 


.8 


9. 


.2 


4105 


24 


ABZ35259 


Human gene express 


39 


395. 


.8 


9. 


2 


4105 


24 


ABV94238 


Breast carcinoma r 


40 


395. 


.8 


9. 


2 


4105 


25 


ABZ42662 


Human endothelin A 


41 


395. 


.8 


9. 


2 


5036 


21 


AAF21447 


Human endothelin r 


42 


395. 


.8 


9. 


2 


117609 


21 


AAF21435 


Human receptor-rel 


43 


395. 


,4 


9. 


2 


4105 


21 


AAA38341 


Human endothelin r 


44 


394 . 


,2 


9. 


2 


1310 


21 


AAF20902 


Human endothelin r 


45 


394 . 


2 


9. 


2 


1310 


21 


AAF20914 


Human ELAM-1 polyn 



ALIGNMENTS 



RESULT 1 
AAQ34584 



ID AAQ34584 standard; DNA; 4301 BP. 
XX 

AC AAQ34584; 
XX 

DT 25-MAR-2003 (updated) 

DT ll-MAY-1993 {first entry) 

XX 

DE ETb receptor gene. 
XX 

KW Human; ETa ; ETb; endothelin; receptor; transmembrane domain; N ta 

KW extracellular; cytoplasmic; C tail; post translational ; bovine; 

KW modification; ET-1 receptor; antagonist; circulatory system; ss. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 238.. 1566 

FT /*tag= a 

FT sig_peptide 238.. 315 

FT /*tag= b 

FT mat_peptide 316.. 1563 

FT /*tag= c 

FT polyA_signal 2595.. 2600 

FT /*tag= d 

FT polyA_signal 3134.. 3139 

FT /*tag= e 

FT polyA_signal 3638.. 3643 

FT /*tag= f 

FT polyA_signal 4101.. 4106 

FT /*tag= g 

FT polyA_signal 4258.. 4263 

FT /*tag= h 

FT misc_feature 1909.. 1913 

FT /*tag= i 

FT /function= Related with mRNA instability 

FT misc_feature 1997.. 2001 

FT /*tag= j 

FT /function= Related with mRNA instability 

FT misc_feature 2119.. 2123 

FT /*tag= k 

FT /function^ Related with mRNA instability 

FT misc_feature 2273.. 2277 

FT /*tag= 1 

FT /function= Related with mRNA instability 

FT misc_feature 2745.. 2749 

FT /*tag= m 

FT /function= Related with mRNA instability 

FT misc_feature 3346.. 3350 

FT /*tag= n 

FT /function= Related with mRNA instability 

FT misc_feature 3484.. 3488 

FT /*tag= o 

FT /function= Related with mRNA instability 

FT misc feature 3495.. 3499 



FT 
FT 

FT misc_f eature 
FT 
FT 

FT misc_feature 
FT 
FT 

FT misc_feature 
FT 
FT 

FT misc_f eature 
FT 
FT 
XX 

PN EP522868-A1. 
XX 

PD 13-JAN-1993. 
XX 

PF 10-JUL-1992; 92EP- 03 0634 7 . 
XX 

PR 12-JUL-1991; 91 JP-0172828 . 
XX 

PA (SHIO ) SHIONOGI SEIYAKU KK. 
XX 

PI Imura H, Nakanishi S, Nakao K; 
XX 

DR WPI; 1993-010677/02. 
DR P-PSDB; AAR30886. 
XX 

PT Human ETa and ETb endothelin receptors - for measuring endothelin 

PT and screening for endothelin antagonists 

XX 

PS Claim 12; Fig 2; 3 9pp ; English. 
XX 

CC The sequences given in AAQ34583-84 encode the human ETa and ETb 
CC endothelin receptors respectively. ETa is a 427 amino acid protein 
CC with a molecular weight of 48,726. ETb comprises 442 amino acids and 
CC has a molecular weight of 49,629. ETa has a higher affinity for 
CC endothelin (ET) -1 and ET-2, whereas ETb has no selectivity for ET-1, 
CC ET-2 or ET-3. The receptors each contain seven transmembrane domains 
CC and have an extracellular N tail and a cytoplasmic C tail. There are 
CC several potential sites for post translational modification, these 
CC sites are identical to those of bovine ET-1 receptor. ETa cDNA is 
CC 91.2% homologous to bovine ET-1 receptor cDNA and ETb cDNA is 61.1% 
CC homologous to that of bovine ETa -receptor . The receptor proteins are 
CC useful as reagents for measuring the amount of ET or screening for 
CC antagonists of the ET receptor when studying the circulatory system. 
CC (Updated on 25-MAR-2003 to correct PN field.) 
XX 

SQ Sequence 4301 BP; 1342 A; 830 C; 815 G; 1314 T; 0 other; 

Query Match 99.9%; Score 42 97.8; DB 14; Length 4301; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4299; Conservative 0; Mismatches 2; Indels 0; Gaps 
Qy 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 



/*tag= p 

/function= Related with mRNA instability 
3632 . .3636 
/*tag= q 

/function= Related with mRNA instability 
3852. .3856 
/*tag= r 

/function= Related with mRNA instability 
4108. .4112 
/*tag= s 

/function= Related with mRNA instability 
4213 . .4217 
/*tag= t 

/function= Related with mRNA instability 



Db 


1 


Qy 


' 61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 



1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 N 1 1 1 ! 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 !l I Ml 1 1 1 M 

GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 6 0 
AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 120 

MIMM II IM IMMI Ml Ml II II II IM IM IM 

AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 120 
AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 180 

M II II II Ml II II II Ml II II M! IMMMMM 

AGGATCAACAOVGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 180 
AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 240 

IIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIMIIMIIIIMIIIIMIIII 

AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 
CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 3 00 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 3 00 
TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

I M 1 1 1 1 1 1 1 1 I II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 III I II 1 1 II 

TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 
ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 420 

IMIMIIIIIIIIIIIIIMMIIIIIIIIIIIIMIIIIIIIMIIIMIIIIIIIII 

ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 420 
CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 

IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIMIIMI 

CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

I! II Mi II M' i'l II I llllllllllllllllllllllll 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTCCTGGGGATCATCGGGAACTCCACA 600 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

MIIIIIIIIIMIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 720 

M MMMM II II Ml II II II II II IMMM II I 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 72 0 

CTG CTGGCAGAGGACTGG C CATTTGGAG CTGAGATGTGTAAG CTGGTG CCTTTCATACAG 78 0 

IIIIIIIIIIIMIIIIIIIIIIIMIIMIIIMMIMMIMIIIMIIMIIIIII 

CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 780 

AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 r 1 1 r iiiiiiiiiiiiiiiiiiiiimiiiiimiiiim 

AAAGCCTCCGTGGGAATCACTGTCCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 

GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 

MIIIIMIMIIMIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIMIIIMI 



Db 



841 G CTGTTG CTTCTTGGAGTAGAATTAAAGGAATTGGGGTT CCAAAATGGACAGCAGTAGAA 900 



Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 




lJOl 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 



ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

IIIIMIIMIIIIMMIIIMIIIIIMIIIEIIIIIIIIIIIII IMIIMIIIII 

ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 1020 

IIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIMIIIIIIIIIIMI 

ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 102 0 

AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 1080 

I II I M II MM || | IMMM Ml II II MM! M 

AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 108 0 

TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 

MIIIIIIIIIIIIIIIIIIIMIIIMIIIIIMIIIIIIIIIIIIIIIIIIIMIMI 

TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 

AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 1200 

IIIIIIIIIIIIIMIIIIIIMIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 1200 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 12 60 

I MM IMM I Mill M II II M IMM MMM MM 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 

AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 1320 

MMM IMM III Ml IMM II MMM II MMMM III III 

AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 1320 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIII 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 

II IM Mil II I II II I IIIMM MMI IM I IMM II 

AACCCT^TTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 

TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 

MIMM I II III I III II II II I II II II MMM III 

TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 15 00 

AAGTTCAAAGCTAATGATC^CGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 

M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 

AAGTTC^WVGCTAATGATC^CGGATATGACAACTTCCGTTCa\GTAATAAATACAGCTCA 1560 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 1620 

M M I II M M ill II I Ml Mi I M; IM I IMM II 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 1620 



M M II II M II II : IM I M IM I MMI M 



TAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG CTGT 174 0 

M IMMMI II MMMM I IM IMM IM I IMMI MM 

TAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG CTGT 1740 



Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 



TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 

IMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMII 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 
TTTTTACAGTTAGCACTTCAACATAGC 1860 

! ; I : ; , I ,;i I I, .1 i; II 

TTTTTACAGTTAGCACTTCAACATAGCTCT 1860 
TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 1920 

1 1 ^ 1 1 ! 1 1 - 1 1 II I i 1 1 1 II I ! 1 1 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 
AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 

I I! IM IM I MM M II M II III MMM MMI Mill 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 198 0 
AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 

I I II Ml I !M I MM II IM II I Ml II III II I 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 
TATCACACTATTATCAGATTGTAATTAGATG CAAATGAGAGAG CAGTTTAGTTGTTG CAT 2100 

MM II Ml I II M MM IIMIIMI M II I IMMI II 

TAT CACACTATTAT CAGATTGTAATTAGATG CAAATGAGAGAGCAGTTTAGTTGTTG CAT 2100 
TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAG CAAGG CTGT 2160 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIMIIIM 

TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 
TTTTGAAAATC^TTAC^CTTTC^CTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 222 0 

M IMM IMMMII I II Ml II M II III IMM MM 

TTTTGAAAATCATTACACTTTCACTAGAAGCCC^W^CCTCAGCATTCTGCAATATGTAAC 222 0 
CAACATGTCACAAkCAAGCA^ 228 0 

I II IMMI I IM MMM II II IMM Ml Ml II Ml I 

CAACATGTCACAAACAAGCAGCATC 2280 
TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 

IMM II II M II M III MMM II II II II II MM 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 
CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

II IMM IM I Ml I II II II II I, Ml MMM III I 

CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTC^TTCA 24 00 
CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 2460 

I II IMMI I III MM! Ml M IIMMMI MMM Ml M 

CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 2460 
TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2 52 0 

M M M IM I II I II II I M I MM MMM III I 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 252 0 
CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 2580 

IIIMIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIII 

CTGCATGTAGATGATTAAATGAGGG CAGGCC CTGTG CTCATAG CTTTACGATGGAGAGAT 2 58 0 



Qy 


2581 


Db 


2581 


Qy 


2641 
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2641 


Qy 


2701 


Db 


2701 


Qy 


2761 
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2761 


Qy 
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2821 


Qy 


2881 
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Qy 


2941 
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Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 



GCC^GTGACCTC^TAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGAC^AAGGG 264 0 

II II I III I Ml II II I MINI I II II II I MM IN Mill II Mill II MM II 1 1 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 2 64 0 
GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

II Ml INI Ml II II lllllll III MM III Mill II II! MM III INI III III 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 27 00 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 

I I I M I II I I II I I II II I II II I II II II II I I I I I I I I I I I I I I I I I I I I II I I I II I 
GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 

TAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAG CCAATAGAAAAGTTTG CTTG 2 820 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAG CCAATAGAAAAGTTTG CTTG 2 820 
ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 

1 1 II I II 1 1 II 1 1 M 1 1 II 1 1 II II I II II I II 1 1 II I II II I II II II 1 1 1 II II II II 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 
AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AG CTTTGTGCGTTCCTG CCTAATTTTTATAT CTTCTAAGCAAAGTGCCTTAGGATAG CTT 2 94 0 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 

I I I I II I I I I I II I I M I I I I II I I I I I I M II I I I II I I II I I I I I I I I I I M II I I I I 
GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3 000 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 

I II 1 1 1 1 1 II 1 1 1 1 1 II I II 1 1 1 1 II II II I II II II 1 1 1 II 1 1 1 II II 1 1 1 II M II II 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 
CGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGC 312 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 

CGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTC 312 0 

GAGTGACTTT CGAAATAAATTGGG C CCAAGAG CTTTAACT CGGT CTTAAAATATG C CCAA 3180 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 E 1 1 1 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 3180 
ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

1 1 1 1 1 i 1 1 1 1 1 ! 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 1 

ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAA 3300 

I I I I I I I M I I I I I I I I I I M II I I I M I I I II I M I I I I II II I I II I I I I II I I I II I 
TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAA^ 3300 

GAAAGAAAGAGO^ATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 
AOWVCTTGTTCTTTAATTTC^TCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 3420 

1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 i 1 1 1 1 1 1 f I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 
3421 CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCAG\GTTTATTAA 34 80 
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3661 
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3721 
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3721 
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Qy 
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3841 


Qy 


3901 
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3901 


Qy 


3961 
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3961 


Qy 


4021 


Db 


4021 


Qy 


4081 


Db 


4081 


Qy 


4141 


Db 


4141 


Qy 


4201 


Db 


4201 


Qy 


4261 



ll :i; II I li II MINI 'I II II III II h II: I II 

CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 3480 
TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

II Ml I! I II II IMMI I M II IMMM M IM I II 

TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 
TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 

' i I 1 1 II : i 1 1 ' I ' [ ! ! I ' i 1 1 1 

TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 
TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 

II I III. I II II llllll I IIMI'IM II :h II II II 

TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 
TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 

I I Ml I Ml II IMMI I II IMM: II Mi II M II 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 
TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 3780 

M I II M MM II II II I IM IMM II I II MM 

TTCTTTAC^TACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTC^WUVCATGT 378 0 
TTCCTAGTATTAAGGACTTTAATATAGCAACAGAa^AAATTATTGTTAAC^TGGATGTTA 384 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

TTC CTAGTATTAAGGACTTTAATATAG CAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 
CAG CTCAAAAGATTTATAAAAGATTTTAAC CTATTTTCTCC CTTATTAT C CACTG CTAAT 3 900 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTG CTAAT 3900 
GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 

II Mi II M II Ml II IMMi II Ml II III MUM I II 

GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3960 
GTTTATAG CAAAACATGGGTATGCTGTAG CTAACTTTATAAAAGTGTAATATAACAATGT 4 020 

I. I III M'lh ll"ll II I 'II II II 'II U II I I 

GTTTATAG CAAAACATGGGTATG CTGTAG CTAACTTTATAAAAGTGTAATATAACAATGT 4 020 
AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 080 

M I II III III II I Ml I IM Mill II M II IMM 

AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 08 0 
TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 

M I Ml IMM: iliiii ii;ii;,ii ii ihiii ll Mill II 

TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 
ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 

M MMM II II M, I I! I II Ml M IM IMM I Ml 

ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 
CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTCAATT 4260 

I M II Ml IM II M IMM MMMM i Ml M II MM 

CAAGTGGACATTATTTATGTTAAATATACAATTATC^GCAAGTATGAAGTTATTCAATT 42 60 
AAAATGCCACATTTCTGGTCTCTGGGAAAAAAAAAAAAAAA 4301 

1 ■ I : . I 1 1 I ■ l ii ■ ! ■ , I 



Db 



4261 AAAATGCCACATTTCTGGTCTCTGGGAAAAAAAAAAAAAAA 4301 



RESULT 2 
AAF21284 

ID AAF21284 standard; DNA; 4286 BP. 
XX 

AC AAF21284; 
XX 

DT 14-MAR-2001 (first entry) 
XX 

DE Human low adenosine antisense oligonucleotide related sequence #2851. 
XX 

KW Low adenosine antisense oligonucleotide; phosphorothioate; allergy; 

KW human; airway disorder; bronchoconstriction; lung inflammation; 

KW surfactant depletion; respiratory; bronchodilator ; antiinflammatory; 

KW immunosuppressive; antiasthmatic; analgesic; hypotensive; cytostatic; 

KW respiratory obstruction; pulmonary obstruction; impeded respiration; 

KW surfactant hypoproduction; pulmonary vasoconstriction; asthma; RDS; 

KW respiratory distress syndrome; pain; cystic fibrosis; allergic rhinitis; 

KW pulmonary hypertension; emphysema; pulmonary transplantation rejection; 

KW chronic obstructive pulmonary disease; pulmonary infection; bronchitis; 

KW cancer; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200062736-A2 . 
XX 

PD 26-OCT-2000. 
XX 

PF 24-MAR-2000; 2000WO-US08020 . 
XX 

PR 06-APR-1999; 99US- 0127958 . 
XX 

PA (UYEC-) UNIV EAST CAROLINA. 

PA (NYCE/) NYCE J W. 

XX 

PI Nyce JW; 
XX 

DR WPI; 2000-679539/66. 
XX 

PT Low adenosine (A) content antisense oligonucleotides which do not 

PT trigger adenosine receptors during metabolism, useful e.g. for treating 

PT cancers and respiratory obstructions - 

XX 

PS Disclosure; Page 1273-1274; 1592pp ; English. 
XX 

CC The present invention describes low adenosine (A) content antisense 

CC oligonucleotides and compositions (I) comprising them. In the antisense 

CC oligonucleotides the A is replaced by a 'Universal' or alternative base. 

CC (I) can have respiratory, bronchodilator, antiinflammatory, analgesic, 

CC immunosuppressive, antiasthmatic, hypotensive and cytostatic activities. 

CC The antisense oligonucleotides and (I) can be used to down-regulate the 

CC expression and or activity of target polypeptides associated with 

CC lung/respiratory disorders and malignancies, such as stimulating and 

CC activating peptide factors and transmitters, transcription factors, 

CC immunoglobulins and antibodies, antibody receptors, cytokines and 



CC chemokines, endogenously produced specific and non-specific enzymes, 

CC binding proteins, adhesion molecules and their receptors, cytokine and 

CC chemokine receptors, adenosine receptors, bradykinin receptors, central 

CC nervous system (CNS) and peripheral nervous and non-nervous system 

CC receptors, CNS and peripheral nervous and non-nervous system peptide 

CC transmitters, defensins, growth factors, vasoactive peptides and 

CC receptors, binding proteins and malignancy associated proteins. The 

CC antisense oligonucleotides may be used in this way to treat disorders 

CC including respiratory obstruction (especially pulmonary obstruction 

CC and/or bronchoconstriction) and/or lung inflammation, allergy (ies) 

CC and/or surfactant hypoproduction which are associated with a disease or 

CC condition selected from pulmonary vasoconstriction, inflammation, 

CC allergies, asthma, impeded respiration, respiratory distress' syndrome 

CC (RDS) , pain, cystic fibrosis <CF) , allergic rhinitis (AR) , pulmonary 

CC hypertension, emphysema, chronic obstructive pulmonary disease (COPD) , 

CC pulmonary transplantation rejection, pulmonary infections, bronchitis, 

CC and/or cancer. AAF18434 to AAF21543 represent human polynucleotide 

CC fragments and antisense oligonucleotides used in the exemplification of 

CC the present invention. 
XX 

SQ Sequence 4286 BP; 1327 A; 829 C; 816 G; 1314 T; 0 other; 

Query Match 99.6%; Score 4284.4; DB 21; Length 4286; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4285; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

MIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIMIIIIIII 

Db 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

Qy 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 120 

IMIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 

Qy 121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 180 

IIIIIIIIIIIIIIMIIIIIMIIIMIMIIIIIIIIMIIIIIIIIIIIIIIIIIII 

Db 121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 180 

Qy 181 AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

IIIIIIIIIIIMMIIIM lllllllllllllllllllllllllllllllllllllll 

Db 181 AACTTGG CT CTGAAACTGCG CAGCGG CCACCGGACG CCTTCTGGAG CAGGTAG CAG CATG 24 0 

Qy 241 CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 3 00 

IIIIIIMIIIIIIIIMIIMIIIIIMIIIIIMIIIIIIIIIIIMIIIIIIIIIII 

Db 241 CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 3 00 

Qy 3 01 TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

' ' ! I : 

Db 301 TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

Qy 361 ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 

IIIIIMIIIIIIIIIIMIIIMIIIIIIIIIIIMIIIIIIMIMIIIIIIIIIIII 

Db 361 ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 

Qy 421 CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 8 0 

IIIIIIIIIIIIIIIIIMIIIIIIIIIMMIIIMIIIIIIIIIIIIMMIIIIIII 

Db 421 CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACA,GGArGnrAnr,ATrTrrr; A An 



Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 



CC^CGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 

IIIIIIIIMMIIIIIIMIIIMIIMIIIIIIIIIMIIIIIIIIIIIIIMIIIII 

CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

I II IM I I II II MM II II II II II II M M III 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 



MM II I M Ml IMM MMI II. I MMMIM I M MM 



I MMM M M MMI I M IMM IM I II II II M II 



CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 780 

I II M I I III IM I I II II I IM MMM I Mill I 

CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 780 
AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 

M II Mill II MMI M M MIIMMI I II II MMI MM 

AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 
GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 

IIIIMIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 
ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

I II I I III II I M II II M ! MMIMIMI M II 

ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 
ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 1020 

II IMM I MMM II MMI II M I Ml III MM MMI 

ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 102 0 
AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 108 0 

MMI M, Ml II IMM M MMM MM! MMI II IMM II 

AAGACAG CTTTCATG CAGTTTTACAAGACAGCAAAAGATTGGTGG CTGTTCAGTTTCTAT 1080 
TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 

I II I I Ml II I I II II I i II IM MM IMM 

TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 
AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 12 00 

MM MMM IMM I M II MMI MMM II M Mill I 

AGAAAGAAAAGTGGCATG CAGATTG CTTTAAATGATCACCTAAAG CAGAGACGGGAAGTG 12 00 
GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 12 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 



MMM M MMM I M IMIMM M Ml II I MMI II 



Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


wy 




Db 


2041 


Qy 


2101 


Db 


2101 



AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 

I I I I I I I I I I I I I I I I I I l;l II MM M I I I II I I ' I I I I I I I I I I III I M I I 
AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 

IMMMIIIIIIIIIIIIIIIIIIIIIIMIMIIMMIMIIIIMIIIIMIIMI 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 
TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 

IIIIIIIIIMIIIIIIMIMIIIMMIIIIIIIIIIMIIMMMIIIIMIIMI 

TG CTGCTGGTG CCAGTCATTTGAAGAAAAACAGT CCTTGGAGGAAAAG CAGTCGTG CTTA 1500 
AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 

IIIIIIMIIIMIIIIIMMIIIIIMMIIIIIIMMIIIIIIIMIIMMIIII 

AAGTTCAAAGCTAATGATC^CGGATATGAC^ACTTCCGTTCCAGTAATAAATACAGCTCA 1560 
TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 1620 

II MUI MUM II I II I II II I IM i Ml IM II 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 1620 
AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTATGTAT^ 1680 

I I II il II II I II I II II I III I IIMIi II 

AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTATGTATTTGCACAGCA 168 0 

TAAAATATTAAGTGTAATTATTTTAACACT CACAGCTACATATGACATTTTATGAG CTGT 174 0 

M 1 1 1 1 1 1 1 1 1 I III II I MMMMM M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 
TTACGG CATGGAAAGAAAAT CAGTGGGAATTAAGAAAG CCTCGT CGTGAAAG CACTTAAT 18 00 

I i I M i 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 

TTACGG CATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAG CACTTAAT 1800 
TTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGAT^ I860 

II MI'll III IIMI II II I IN II III III I III Mi II 

TTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCC^ I860 
TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 

II MMII MMM II II II I IIMII III III I IMMII II 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 1920 
AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 198 0 

I I Ml M II! II II II M MMMMM II Mill M II 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 
AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 

I I II i II Mill II M IMMIMI II IMM I II 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 
TATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCAT 2100 

M I IM I M III II II MMI M II Ml IM IM I II 

TATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCAT 2100 
TTTTCGGACACTGGAAA(^TTTAAATGATCAGGAGGGAGTAA(^GAAAGAGCAAGGCTGT 2160 

f 1 1 1 1 m 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 r 1 1 1 1 1 m 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 



Qy 



2161 TTTTGAAAATC^TTAC^CTTTC^CTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 222 0 



Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 



I II II I Mill III I M II II I IIIIIMI III Mill I 

TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 2220 
CAACATGTCACAAACAAGCAGCATGTAACAGACTGGCACATGTGCCAGCTGAATTTAAAA 2280 

i ii ii i iiiie iii i 'i ii ii i inirii Mi i mi i 

CAACATGTCA.GAA& 2280 
TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTC^GTTAAGATCAAACCTCA 234 0 

II lllllll III 1 1! Illllll MINIM Mill III MMIIMIMI! Mill III 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 2340 
CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

M MMM I M M II MM II IMM iMMIMI Mi Mill! 

CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 
CATACC CTGTGAAGACAATACTAT CTACAATTTTTTCAGGATTATTAAAATCTT CTTTTT 24 60 

M II II I M II II I M II II M M MMM M, I MM 

CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 24 60 
TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2520 

M II II I Mill II I M II II M M Ml II M I Ml I 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2 520 
CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 2580 

II IMM I Mill II I II IMIIMI M MMM II I MIM 

CTG CATGTAGATGATTAAATGAGGGCAGG CCCTGTG CTCATAGCTTTACGATGGAGAGAT 2 580 
GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 2 64 0 

III IIIIMI lllllllll III lllllllllllllllll II lllllllll II IIMIIII 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 2640 
GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

II MMMM IMM II MM IIIIIMI II MMM II I MIM 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2 700 
GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2 760 

M II II M IMIIMI I M IMIIMI MMM II I I Ml 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 
TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 2 820 

I Ml II M I IMIIMI M II II II MMI II M l MM 

TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 2820 
ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 288 0 

IMM M MMM II MM MMM, I M MMM I MMI 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 



I II II I: I IMM M i MMM I M Ml M I I MM 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 
GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 

I II IMM M; II II M I Ml II M I II IMM M II 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 
GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 

M II M I I IMMM I II II M. IMM IMMMI MM 



1 



Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Ov 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 



3001 GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3 060 
CGTCA.CATCAATGCAAAAGGTCCTGATTTTC 312 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIMIIIIIIIIIII 

CGTCACATCAATGCAAAAGGTCCTGATT^ 312 0 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 318 0 

IIIIMIIIIIMIIIIIIIMIIIIIIIIIIIIIMMMIIIIIIMIIIIIIIIIII 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 318 0 
ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIMMMIIII 

ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 



IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 



GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAAT CACC C 3360 

MIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 
ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 

1 1 1 1 111 11,111 I 1 1 ' 1 1 I ! : 1 1 i I ^ 1 1 1 1 1 1 1 l 

ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 
CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 34 80 

I II MMIMMI II II I MIIIMI Mill MMI I! 

CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 34 80 
TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

II III I IIIIIM II II II I II MMI III II 11:111 II I 

TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 
TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 

mi in i ill ii ini ii i ii i :i iii ii iMiii ii i 

TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 
TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 

MIMIM II MIMMIII M MM II lllllll MIMI M 

TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 
TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 3720 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 3720 



ii ii I 'ii ii m m ii i ii i :i:mi : i i ii ii 



MUM MM MMI II I MM M Ml MM II IM 



CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3900 

M IM M II IMM IM M MMI MM II IMMMIMMM 

CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3 900 



Qy 3 901 GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3960 

S I < ; ' : 1 1 1 1 1 ill. I II 1 1 1 : 1 1 1 : ; I 1 1 

Db 3901 GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTAQ\TATGGCCAAAGGAATACA 3 960 



Qy 3 961 GTTTATAG CAAAACATGGGTATGCTGTAG CTAACTTTATAAAAGTGTAATATAACAATGT 4 020 

IIIMIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

Db 3961 GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4 020 

Qy 4 021 AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 080 

I ill III MM! MIMMMI I II Ml MIMI MMM II II 

Db 4 021 AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 08 0 

Qy 4081 TTTATTATC^ 4140 

Db 4 081 TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 

Qy 4141 ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 

I 'IUII M II MIMM II I II Ml IMMI I Ml M I! 

Db 4141 ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 

Qy 42 01 CAAGTGGACATTATTTATGTTAAATATACAATTAT CAAG CAAGTATGAAGTTATTCAATT 4260 

1 1 1 s: 1 1 1 ! I ! 1 1 M M 1 1 1 1 M! 1 1 1 MM 1 1 1 1 1 1 1 1 1 ! II 1 1 I li I i 1 1 1 1 1 1 

Db 4201 CAAGTGGACATTATTTATGTTAAATATACAATTAT CAAG CAAGTATGAAGTTATTCAATT 4260 

Qy 42 61 AAAATGCCACATTTCTGGTCTCTGGG 4286 

llllillllllllllllilllllill 

Db 4261 AAAATGCCACATTTCTGGTCTCTGGG 4286 



RESULT 3 
AAA35162 

ID AAA35162 standard; DNA; 4286 BP. 
XX 

AC AAA35162; 
XX 

DT 28-JUL-2000 (first entry) 
XX 

DE Human adenosine receptor related polynucleotide 2nd SEQ ID NO:36. 
XX 

KW Human; adenosine receptor; low adenosine antisense oligonucleotide; 

KW phosphorothioate; impaired respiration; inflammation; allergy; 

KW allergic disease; bronchoconstriction; inhibitor; antiinflammatory; 

KW antiallergic; antiasthmatic; cytostatic; analgesic; impaired airway; 

KW lung disease; ischaemic condition; pulmonary vasoconstriction; asthma; 

KW respiratory distress syndrome; pain; cystic fibrosis; emphysema; 

KW pulmonary hypertension; chronic obstructive pulmonary disease; COPD; 

KW cancer; leukaemia; lymphoma; carcinoma; metastasis; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200009525-A2 . 
XX 

PD 24-FEB-2000. 
XX 

PF 03-AUG-1999; 99WO-US17712 . 
XX 



PR 03-AUG-1998; 98US-0095212 . 
XX 

PA (UYEC-) UNIV EAST CAROLINA. 
XX 

PI Nyce JW; 
XX 

DR WPI; 2000-205971/18. 
XX 

PT New antisense oligonucleotides useful for treating e.g. pulmonary 

PT vasoconstriction, inflammation, allergies, asthma, hypertension, 

PT bronchitis, emphysema, respiratory distress syndrome, ischemia or 

PT cancers 
XX 

PS Disclosure; Page 1191-1192; 1343pp; English. 
XX 

CC The present invention describes a new composition comprising an 

CC antisense oligonucleotide (ON) with low adenosine (up to 15%) , which 

CC targets nucleic acids involved in bronchoconstrict ion, allergies, and/or 

CC inflammation. The ON can have antiinflammatory, antiallergic, 

CC antiasthmatic, cytostatic and analgesic activities. The compositions are 

CC useful for the treatment of diseases associated with inflammation, 

CC impaired airways, including lung disease and diseases whose secondary 

CC effects afflict the lungs of a subject. They can be used for treating 

CC e.g. ischaemic conditions, pulmonary vasoconstriction, allergies, 

CC asthma, impeded respiration, respiratory distress syndrome, pain, cystic 

CC fibrosis, pulmonary hypertension, emphysema, chronic obstructive 

CC pulmonary disease (COPD) , and cancers such as leukaemias, lymphomas, 

CC carcinomas, and cancers which may metastasise to the lungs, including 

CC breast and prostate cancer. The reduction of the adenosine content of 

CC the ONs reduces side effects. The A-containing ONs break down with the 

CC release of deoxyadenosine which activates adenosine receptors causing 

CC bronchoconstriction and inflammation. AAA32313 to AAA35312 represent the 

CC nucleotide sequences given in the sequence listing from the present 

CC invention, which correspond to SEQ ID NO:l to 2815, and then the last 

CC 185 sequences are also called SEQ ID NO:l to 185, but the sequences 

CC differ from the previously named sequences. SEQ ID NO: 11 to 1680 

CC (AAA32323 to AAA33992) are specifically claimed ONs from the present 

CC invention. N.B. Sequences given in the disclosure of the present 

CC invention do not match up with their corresponding SEQ ID NO: sequences 

CC given in the sequence listing. 

XX 

SQ Sequence 4286 BP; 1327 A; 829 C; 816 G; 1314 T; 0 other; 



Query Match 99.6%; Score 4284.4; DB 21; Length 4286; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4285; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 


i 


GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 


60 






IIIIIIMIIIIIIMMIIIIIIIIIIIIMIMIMIIIIIIIIIMIIIIIIMIM 




Db 


i 


GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 


60 


Qy 


61 


AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 


120 






1 1 ill II Ml! I II II '1 




Db 


61 


AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 


120 


Qy 


121 


AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 


180 



I M II: M U II r II Ml I! IM MMI II III II ! 



Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Ov 


Jul 


Db 


901 


Qy 


961 


Db 


961 



121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 180 
AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

IIIIIIIIIIMIIIIIIII IIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

AACTTGGCTCTGAAACTGCGCAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 
CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 

IIIIIIIIIMMIIIIMIIIIIIIIMIMIMIIIIIIIIIIIIIIIIIIIIIMII 

CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 
TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

IIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIMIIMIIMIMM 

TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 
ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 

IIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIMIIIIIMIMIIIII 

ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 
CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 

II II 11 II I II II II II II II I II MM Mil II I II II II II 1 1 II I II II II II II 1 1 

CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 8 0 
CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 M 

CC^CGCACC^TCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 
TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 
CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

I II M IM IM I II Ml Ml I III IM I Ml Ml II 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 72 0 

CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 780 

M I I M I M I I I I I I II I I I II II II M II I M I II I I I I I I I I I I I II I I I I I I I II I I 
CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 78 0 

AAAG CCTCCGTGGGAATCACTGTG CTGAGTCTATGTG CTCTGAGTATTGACAGATAT CGA 84 0 

Ml ill M III Ihlll III II II II MIlMl ll;ll III II 

AAAGCCTCCGTGGGAATCACTGTG CTGAGTCTATGTG CTCTGAGTATTGACAGATATCGA 84 0 

G CTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAG CAGTAGAA 900 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 

G CTGTTG CTTCTTGGAGTAGAATTAAAGGAATTGGGGTTC CAAAATGGACAG CAGTAGAA 900 

ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

I I [ I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 102 0 

1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 1020 



Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 



AAGACAGCTTTCATGCAGTTTTA(^GACAGC^AAAGATTGGTGGCTGTTCAGTTTCTAT 1080 

IIIIIMIIIIIIIIIMIIMIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIM 

AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 1080 



IIIMIMIIIIIIIMIIIIMIIIIIIIIIIIIMIIIIIIIIIIMIIIMIIIMI 



AGAAAGAAAAGTGG CATGCAGATTG CTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 12 00 

IIIMIIIIIIIIIIIIMMIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIII 

AGAAAGAAAAGTGGCATGCAGATTG CTTTAAATGAT CACCTAAAGCAGAGACGGGAAGTG 12 00 
GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 12 60 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMII 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 12 60 
AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 132 0 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIII 

AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 132 0 
AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 

I I M !l' II I II I: II, I IMIMI II I II ,1 ,! 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 138 0 



M I II ! II I IM I II II Ml II M II M II II Ml 



Mill MM III MM III MM III lllllll IMIMI III MM IMIMI III II 



AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 

MMIMMIMI MIMI ll III I II II MIMM MUM I II 

AAGTTCAAAGCTAATGATCACGGATATGAGAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 
TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 
AACAAAATGAAACATTTGCOW\ACAAAACAAAAAAC 168 0 

M MM MMMM Ml M III! I MIMI Ml II MIMI M III 

AACAAAATGAAACATTTGCCAAAAGAA 168 0 

TAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG CTGT 174 0 

I II II M II I IM II II M M | Ml III II III II I 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 
TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 

I MMI I II M II M Ml I MIMM II II MIMI I Ml 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 18 00 
TTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATTCACACAACACT 1860 

J i ) i 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 r 1 1 1 1 j 1 1 1 1 1 1 1 ! 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 

TTTTTAC^GTTAGC^CTTCAACATAGCTCTTAACAACTTCCAGGATATTCACACAACACT 1860 



Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Ov 


£t -J O -L 


Db 


2581 


Qy 


2641 


Db 


2641 



TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 

IIIIIIIIIIMIIMIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 
AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 

IIIIIIIIIIMIIIIMIIIIIIIMIIIIMIIIIIIIIIIIMIIIIIIIIIIIIII 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 198 0 
AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 204 0 

imiiiiiiiiiiiiiiiiiiiiiiimiiiiimiiiiiiiiiiiiiiiiiiiiiiiii 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 204 0 
TATCACACTATTATCAGATTGTAATTAGATG CAAATGAGAGAG CAGTTTAGTTGTTGCAT 2100 

MIMIIIIIIIIIIIIIIIIIIIIIMIMIIMIIIIIIIIIIIIIMIIIIIIIIII 

TATCACACTATTATCAGATTGTAATTAGATG CAAATGAGAGAG CAGTTTAGTTGTTGCAT 2100 
TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 

IMIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIMIIIIIMIIIIIIIIII 

TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 
TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTC^ 222 0 

' 1 1 1 1 1 II ! ' 1 1 I 1 1 ! M . 

TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 2220 
CAAC^TGTCACAAACAAGCAGCATGTAACAGACTGGCACA 2280 

I M II UNI Ml I' II IIMI II II II Ml IIUI 'I 

CAACATGTCACAAACAAGCAGCATGTAAC^GAC 2280 
TATAATACTTTTAAAAAGAAAATTATTACTVTCCTTTAC^ 234 0 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 
CAAAGAGAAATAGAATGTTTGAAAGG CTATCCCAAAAGACTTTTTTGAATCTGTCATT CA 24 00 

IN llllll II M II II II 1 II llllllll MM I Ml III 

CAAAGAGAAATAGAATGTTTGAAAGG CTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 
CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 24 60 

M M IM II II II I II II M IIMI II II MM IM I 

CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 24 60 
TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 252 0 

I Ml II IMM MM MMI Ml I II M II IM III III I 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 252 0 
CTGCATGTAGATGATTAAATGAGGGCAGGC C CTGTGCT CATAG CTTTACGATGGAGAGAT 2580 

IIIIIIIIIIIIIIMIMIIIIIMIIIIIIIIIIIIIIIMIIIIIIIMIIIIMII 

CTG CATGTAGATGATTAAATGAGGGCAGG CCCTGTG CTCATAGCTTTACGATGGAGAGAT 2580 
GCC^GTGACCTCATAATAAAGACTGTGAACTGCCTGGTGC^GTGTCCACATGAC^AAGGG 2 64 0 

IIIMIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIMMIIIIIIIIIIII 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 2 64 0 
GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

I ' 1 1 1 ' ll 1 1 1 1 1 1 1 1 II i [ 'MMI i l j I , 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2 700 



Qy 



2 701 GCTATAGTTAAAATACTATTTTTC^AAATC^TACAGATTAGTACATTTAACAGCTACCTG 2760 



Ill II I III III I II II I II! Ill II II III II I MINI IMIIIMM III III II I 

Db 2 7 01 G CTATAGTTAAAATACTATTTTT CAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 

Qy 2 761 TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 2 820 

I II I M I M II II M I MM M IMM II; I I M 

Db 2761 TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 282 0 

Qy 2 821 ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 288 0 

IIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMMIIIIIII 

Db 2821 ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 288 0 

Qy 2 881 AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 

MIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIII 

Db 2881 AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 294 0 

Qy 2 941 GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3 000 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 j i i r 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 

Db 2 941 GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3 000 

Qy 3 001 GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 

IIIIIIIIIIIIIIIIMIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIII 

Db 3 001 GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 

Qy 3 061 CGTC^C^TCAATGCAAAAGGTCCTGATTTTGTTC 3120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3 061 CGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAAC^CAGTGCAATGTTCTCA 3120 

Qy 3121 GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 3180 

MM MM M I M M II M I Mill II IMM I I I 

Db 3121 GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 3180 

Qy 3181 ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

I II I M I I II II I! I MMI I IMM IMMM I 

Db 3181 ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

Qy 3241 TTGTTTTCTGTCAATATTGAATGTC^ 3300 

M !l I M I I II II I! I M'M M IMM II il Ml I 

Db 3241 TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCA 3300 

Qy 3301 GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 

M II I M I M: II II M I I ! M II II II M MM 

Db 3301 GAAAGAAAGAGC^ATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 

Qy 3361 ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 

i i i i i i i i i i i i i i i i i i i i i i i i i i i i ri i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

Db 3361 ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 

Qy 3421 CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 34 80 

I M I M I I II II II i M II M II; I M I M 

Db 3421 CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 348 0 

Qy 34 81 TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

I I I I M II M M I I, I I IMM II I I I 

Db 3481 TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

Qy 3541 TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 

M MMMI M IMM II MM M I MIMI M M I MM 



Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 


3961 


Qy 


4021 


Db 


4021 


Qy 


4081 


Db 


4081 


Qy 


4141 


Db 


4141 






Db 


4201 


Qy 


4261 


Db 


4261 



3541 TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 
TGAAACTAC^CACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 

II M 1 1 1 1 1 1 i M Ml 1 1 1 1 1 1 1 II 1 1 1 i 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 

TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 

I II I I Ml I II IMM IMM MMMINNIMI M 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 

TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 378 0 

I !h I! i MM. 11.11 I III II 1 I III II I HI II 

TTCTTTACATACTCAuAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 378 0 



IM II M II :MMIMI MMI M M Ml II M Ml II 



CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3900 

I IMM:MI MMMIIIMI IMIIMI III MMI II MIMIMI 

CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3 900 
GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3960 

Ml II II MMM I Ml IMM |N| IMIMI II MMM I 

GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 
GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4 020 

IN I MMM I II INI UN I IMIMM MMM I 

GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4 020 
AAAAAATTATATATCTGGGAGGATTTTTTGGTTGC CTAAAGTGG CTATAGTTACTGATTT 4 080 

MMM MINI ! II II M MM Nil IMM II UN 

AAAAAATTATATATCTGGGAGGATTTTTTGGTTG CCTAAAGTGG CTATAGTTACTGATTT 4 08 0 



II II I M IMM.MIMI I Ml M I INNNM It M 



ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 

II MM II UN II Mill IM I MINIM I IMIMM 

ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 
CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATC 4260 

, i I i: ! I - ;l ! I : i M 

CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAA 4260 
AAAATGCCACATTTCTGGTCTCTGGG 4286 

IIIIIMIIMIIIIIIIIIIIIIII 

AAAATGCCACATTTCTGGTCTCTGGG 4286 



RESULT 4 
ABV9418 6 

ID ABV94186 standard; cDNA; 4286 BP. 
XX 

AC ABV94186; 
XX 



DT 08-JAN-2003 (first entry) 
XX 

DE Breast carcinoma related nucleotide sequence SEQ ID NO: 177. 
XX 

KW Human; breast carcinoma; cancer; tumour; cytostatic; anti-tumour; 

KW gene; ss. 

XX 

OS Homo sapiens . 
XX 

PN WO200246467-A2 . 
XX 

PD 13-JUN-2002. 
XX 

PF 07-DEC-2001; 2 0 01WO- IB028 1 1 . 
XX 

PR 08-DEC-2000; 2000US-254090P . 

PR 07-DEC-2001; 2 0 01US- 0007 926 . 
XX 

PA (IPSO-) IPSOGEN. 
XX 

PI Bertucci F, Houlgatte R, Birnbaum D, Nguyen C, Viens P, Fert V; 
XX 

DR WPI; 2002-619023/66. 
XX 

PT Novel polynucleotide library useful in molecular characterization of a 

PT carcinoma, comprising a pool of polynucleotide sequences or its 

PT subsequences which are either underexpressed or overexpressed in tumor 

PT cells 

XX 

PS Claim 1; Page 225-226; 401pp ; English. 
XX 

CC The present invention describes a polynucleotide library (I) useful in 

CC the molecular characterisation of a carcinoma, comprising a pool of 

CC polynucleotides or its subsequences which are either underexpressed or 

CC overexpressed in tumour cells, and correspond to any of the 

CC polynucleotide sequences chosen from the 468 sequences given in ABV94 010 

CC to ABV94477. Also described: (1) a polynucleotide array (II) useful for 

CC the prognosis or diagnostic of tumour, comprising (I); and (2) detecting 

CC (Ml) differentially expressed polynucleotide sequences which are 

CC correlated with a cancer, involves obtaining a polynucleotide sample from 

CC a patient, and reacting the polynucleotide sample obtained with a probe 

CC immobilised on a solid support, where the probe comprises any combination 

CC of the polynucleotide sequences of (I) or its expression products encoded 

CC by polynucleotide sequences of (I), and detecting the reaction product. 

CC (I) have cytostatic activities and can be used as anti-tumour agents. (I) 

CC is useful in molecular characterisation of a carcinoma. (I) and (II) are 

CC useful for the prognosis or diagnostic of tumour, in differentiating a 

CC normal cell from a cancer cell, detecting a hormone sensitive tumour 

CC cell, differentiating a tumour with lymph nodes from a tumour without 

CC lymph nodes, differentiating antracycline-sensit ive tumours from 

CC antracycline- insensitive tumours, and classifying good and poor prognosis 

CC primary breast tumours. (I) is useful for large-scale molecular 

CC characterisation of breast cancer that help in prediction, prognosis and 

CC cancer treatment, and for detecting differentially expressed genes that 

CC correlated with a cancer. 

XX 

SQ Sequence 4286 BP; 1327 A; 829 C; 816 G; 1314 T; 0 Other; 



Query Match 99.6%; Score 4284.4; DB 24; Length 4286; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4285; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 


l 


Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 



GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 6 0 

IIMIIIIIIIIMMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 6 0 
AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 

IIMIIIIIIIIMIIIIIIMIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 
AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 18 0 

I Ml ! I. II I I llhl M IN i, Ml II I Mill 

AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 18 0 
AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

Ml INI IIIUMI 1 1 II 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 i II! II! II 1 1 ! M I 

AACTTGGCTCTGAAACTGCGCAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 
CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 

I IMIIMI Mill Mill M MMI M IMMMIMIIMIMI 

CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 
TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

IMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIMIIIIIIIIIIIIII 

TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 
ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 

II II M I MIMM II II I II II I II II I Mill 

ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 
CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 

MIMIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIMMIIIIIIMIIIII 

CTGGCGCGGf CGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 
CCACGC^CC^TCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 
TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIII 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 
CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

IMIMIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMII 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 
AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 720 

MMM I II II I Ml MMI II II' MMI II I IMM I 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 72 0 
CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 780 

IIIMIIIMIMIIIIIIIIIIIIIIIIIIMIMIIMIIIIIIIIMIMIIIIIII 

CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 780 



Qy 


781 


AAAG CCTCCGTGGGAATCACTGTG CTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 


840 






MIMM MMIIIMI MIIMIIIIIMI IIMIIIMI IIIIIIIIIIIIM 




Db 


781 


AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 


840 


Qy 


841 


GCTGTTG CTTCTTGGAGTAGAATTAAAGGAATTGGGGTT CCAAAATGGACAG CAGTAGAA 


900 


Db 


841 


1 M 1 1 1 1 1 1 1 II 1 II 1 Ml 1 1 1 III Ml II Mi 1 1 M 1 II II M 1 II 1 1 1 1 1 1 1 1 1 M 

GCTGTTG CTTCTTGGAGTAGAATTAAAGGAATTGGGGTT CCAAAATGGACAG CAGTAGAA 


900 


Qy 


901 


ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 


960 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


901 


ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 

* *■ - 1 * vj * x x vjx^i x x x vj V— j vj x vj \_j x Vwf x x VJ x VJ VJ x x >w x vj VJ \w X vJ x V** VvV^ X vJx^xxx.VJ v_ V__r-ft X livjvj x x x x Vjr^ X 


960 


Qy 


961 


ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTT(^TCCCX3TTCAG 


1020 






II II MM MM III MM III INI MM llllll! MM III Ml! MM MM III 




Db 


961 


ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATrTnrTTnrTTrATrrrnTTfAn 


1 020 


Qy 


1021 


AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 


1080 






M 1 II II 1 1 1! 1 II M IE 1 II II 1 1 II 1 1 1 II II 1 1 I 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 




Db 


1021 


AAGACAGCTTTCATGCAGTTTTACJUXGACAnCAAAAGATTGGTGGCTnTTPAnTTTCTaT 


1 OftO 

X U O \J 


Qy 


1081 


TTCTGCTTGCC^TTGGCa\TCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 


1140 






II 1 1 II II 1 1 M II 1 1 1 I 1 1 1 II II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 




Db 


1081 


TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 

x x v-i x VJ *w x X VJ V^J x X X VJ VJ Vw V—I* X V-f* v^ X VJ V#*^x X X X X X X X xx X x^ VwX*. V— ■ X xxxx. X \Jixl V-* Vw* X vj X VJfuLrl X vJ X X \J 


1140 


Qy 


1141 


AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 


1200 






1 1 1 1 1 1 1 I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


1141 


AGAAAGAAAAGTGG CATG PAGATTGPTTTA A A TP A T P A PPT A A A P P A P A P A rnnfi A A PTP 


1900 


Qy 


1201 


GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 


1260 






f 1 1 1 1 M 1 i 1 1 1 1 1 M 1 1 1 1 II 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 




Db 


1201 


GCCAAAACCGTCTTTTGCCTGGTPPTTGTPTTTGPPPTPTGPTGPPTTPPPPTTPAPPTP 


1 260 


Qy 


1261 


AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 


1320 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1261 


AG CAGGATTCTGAAG CTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 


1320 


Qy 


1321 


AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 


1380 






II 1 1 III 1 1 II 1 1 1 1 1 1 1 1 1 MM 1 1 III 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 




Db 


1321 


AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGPTTPAPTGAATTPPTGPATT 

* 1 \v x x x \_ x \j x x v— i i x x \j\jn\« x in x x uvj in x \unn,v^n x uvjVv X X v__n.v_ X vjnfl X X v x v_J LTl X x 


1 7R0 
ijou 


Qy 


1381 


AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 


1440 






II II III MM MM MMIIIMM MM lllllll MM III 1 III II II MM MM 




Db 


1381 


AACCCAATTGCTCTGTATTTGGTGAG CAAAAGATTCAAAAAPTG PTTTAAGTPATG PTT A 

* ' V-* V-*x ix x x x VJ V^ X \w- X VJ X xx XXX VJ VJ X VJ**.VJ V^fMUiTlVJIi X X VMJxT^TlfuiVMF X VJ V-* XXX /lfisJ x wV X vJ v*- X X xl 


X *± 1 V 


Qy 


1441 


TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 


1500 






MMIIIMM IMIMIMMIMMMMMMIMMMMIMMIMIMIMM 




Db 


1441 


TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTPPTTGGAGGAAAAGPAGTPGTGPTTA 

vj » x vj v»* x vj vj x vj v-i v M iriu x VmJ^ xxx vjriTiVjriiTiijrMi \_-xx vj x v^ v—- x x vjvjriwvjrifuir^Vj v^nvj x v^vj X vj v^ X X xx 


1 R 00 


Qy 


1501 


AAGTTCAAAG CTAATGATCACGGATATGACAACTTC CGTTCCAGTAATAAATACAG CT CA 


1560 






II MIMMMII Mill IM II M MM MIMM 1 IMIMI ! 




Db 


1501 


AAGTTC^AAGCTAATGATCACGGATATGAC^CTTCCGTTCCAGTAATAAATACAGCTCA 


1560 


Qy 


1561 


TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 


1620 






II II II II II Mill II; II II Ml II M: II MIMM 




Db 


1561 


TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 


1620 



Qy 



1621 AACAAAATGAAAC^TTTGCCAAAACAAAAaUW^CTATGTATTTGCACAGCACACTAT 1680 



Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 



IIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIMIIMIIIIIIIIIIIIIIIIIIMI 

AACAAAATGAAACATTTGCCAAAACAAAACAAAAAAOT 1680 
TAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG CTGT 174 0 

Ml III Mill Ml II IMMM I IMMIMI IMMIIII II II 

TAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG CTGT 174 0 
TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 

M II II II II III .Mill I MM' II II I II I II 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 
TTTTTACAGTTAGCACTTCAACATAGCTCTTAACA^ I860 

I II M II I Ml I II M I! M II II M MMI II 

TTTTTACAGTTAGCACTTGAACATAGCTCTTAACAACTTCCAGGATATTC^ 1860 
TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 

M MMI M II I IM I II II II M Mill M Mill II 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 
AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 198 0 

II II; MM! M II !M IM M IM MMI IM Mil! I I 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 
AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 
TAT CACACTATTATCAGATTGTAATTAGATG CAAATGAGAGAG CAGTTTAGTTGTTGCAT 2100 

M MMI I IM M II: I II M MM IMIMMMM MMI M 

TATCA(^CTATTATC^GATTGTAATTAGATGOW^TGAGAGAGCAGTTTAGTTGTTGCAT 2100 
TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAG CAAGG CTGT 2160 

IIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIMIMIMIIIIIIIII 

TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 
TTTTGAAAATCATTACACTTTCACTAGAAGCCO^ACCTCAGCATTCTGCAATATGTAAC 222 0 

1 1 n 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTTTGAAAATCATTAC^CTTTC^CTAGAAGCCC^W^CCTCAGCATTCTGC^ATATGTAAC 2220 
CAACATGTCACAAACAAGC^^ 228 0 

M M I IM I IM I II M Ml I Ml M MI II II M 

CAACATGTCACAAACAAGCAGCATGTAAC^^ 228 0 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTO^GTTAAGATCAAACCTCA 234 0 

Ml II MMI II Ml M II I II Ml MUM Mill: II I 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 
C^^GAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

MMMM II MM, MMi II, MMM I MM I II II Ml II 

CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 
CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 24 60 

ImMiM M IMMI MM, IMMMII I MMM II I Ml II 

CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 2460 
TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 252 0 

IMM I M IMMI IMMMII MIMM I MMM II I Ml II 



Db 



24 61 TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 252 0 



Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


^1 ft 1 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 



CTG CATGTAGATGATTAAATGAGGG CAGG CCCTGTG CTCATAG CTTTACGATGGAGAGAT 2580 

CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 2580 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 264 0 

IIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIII 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 264 0 

G CAGG TAG CACCCTCTCTCACCCATG CTG TGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

IIIIIIIIMIIMIIIIIMI'IIII llllllllllllll INI INI IIIMI 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 



M li M, M IM I II I II II I M IM M M II I 



TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 282 0 

I Ml IMII II III I II I II II M II MMIMM Ml 

TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 282 0 



I II MMM II III I IM I II M II II III M I II I 



I II IMM II IM I IM I Mill II I II M I II I 



GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 

1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 II II 1 1 I II IM 1 1 1 ,1 1 1 M 1 1 1 i M II M II I ,i 1 1 II 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 
GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 

MIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMMIIIIIMIIIIII 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 
CGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAA^ 312 0 

M Mill II II IM I IM I II II M M II IMMMIMM 

CGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACA 3120 
GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 3180 

IIMIMMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIMIIIIIIII 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 3180 
ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

I Ml MMM II II MMI IMM II II II Ml I M M I 

ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 
TTGTTTTCTGTCAATATTGAATGTGATGGTAC^GTAAACCAAAACCCAACAATGT^ 33 00 

IIIIIIMIMIIIIIMIIMIIMIIIIMMIIIIIIIIIIIIIIIMIIIIMIII 

TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCC^CT^TGTGGCC^ 33 00 



IIIIIMIMIMIMIMIIIMIIIIIIMIIIMIIIMIIIMIIIIIIMIIII 



Qy 


3361 


ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 


3420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 I 1 1 1 
M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 I 1 M 1 1 1 




Db 


3361 


AC^AACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 


3420 


Qy 


3421 


CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 


3480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 I 1 1 M 1 1 M M I 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 




Db 


3421 


CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 


3480 


Qy 


3481 


TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 


3540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
M 1 M 1 M 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 II 11 1 1 1 1 1 II II 1 II II 1 1 1 1 1 1 1 1 1 1 II I I I II 




Db ■ 


3481 


TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 


3540 


Qy 


3541 


TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 


3600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
M 1 II 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I II I I I I | I I I | I | | | | || | 1 | | 




Db 


3541 


TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 


3600 


Qy 


3601 


TGAAACTACACAC^AAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 


3660 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


3601 


TGAAACTAC^CACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 


3660 


Qy 


3661 


TTTAAAAAAAATGTTTGATTC^UyVCTTTAAC^TACTGATAAGTAAGAAAC^TTATAAT 


3720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 M 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


3661 


TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 


3720 


Qy 


3721 


TTCTTTAC^TACTC^W^CC^GATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 


3780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


3721 


TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 


3780 


Qy 


3781 


TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 


3840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I 




Db 


3781 


TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 


384 0 


Qy 


3841 


CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 


3900 






1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
M 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 II 1 1 1 I 1 I I I I I 1 | | | | | | | | | | | | | | 1 | [ 1 1 1 1 1 1 1 1 1 1 




Db 


3841 


CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 


3 900 


Qy 


3901 


GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 


3960 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 
1 1 N 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I I I I I I | | | | | | | | | | | || | | | | II | | | | | | | | 1 | | | | | 




Db 


3901 


GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 


3960 


Qy 


3961 


GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 


4020 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 




Db 


3961 


GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTA 

*** * * ** * "* ******* * * ■*■ wvj^j x x * x x >w* x ±x\^j Vw. x **zj. v_# x x x a* x rrrUiTivj x \j x x x x^^a V_t-/jJa x \J x 


4 00 0 


Qy 


4021 


AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 


4080 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I 




Db 


4021 


AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTOATTT 


4 Oft 0 

*± U O \J 


Qy 


4081 


TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 


4140 






llllllllllllllllllllllllllllllllllllllllllliiiiiiiiiiiiiiiii 




Db 


4081 


TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 


4140 


Qy 


4141 


ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 


4200 






1,,11'M MUM MM III ,11 II. II, II II '11,11 IIIU 




Db 


4141 


ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 


4200 



Qy 42 01 CAAGTGGACATTATTTATGTTAAATATACAATTATC^ 4260 

i I I I I I I I I I I II I I I I II I 'I I I I I i I III II ! I M M I I I I II I I I I I I I I I II I I 

Db 42 01 CAAGTGGACATTATTTATGTTAAATATA(^TTATC^GCAAGTATGAAGTTATT(^ATT 4260 

Qy 42 61 AAAATGCCACATTTCTGGTCTCTGGG 4286 

IIIIIIIIIIIMIMMIIIMIII 

Db 42 61 AAAATGCCACATTTCTGGTCTCTGGG 4286 

RESULT 5 
ABZ42661 

ID ABZ42661 standard; DNA; 4286 BP. 
XX 

AC ABZ42661; 
XX 

DT 04-MAR-2003 (first entry) 
XX 

DE Human endothelin B receptor nucleotide SEQ ID NO: 113. 
XX 

KW G protein-coupled receptor; GPCR; antigenic peptide; gene therapy; 

KW G protein-coupled receptor modulator; antibody; immune-related disease; 

KW growth-related disease; cell regeneration-related disease; AIDS; cancer; 

KW immunological -related cell proliferative disease; autoimmune disease; 

KW Alzheimer's disease; atherosclerosis; infection; osteoarthritis; allergy; 

KW osteoporosis; cardiomyopathy; inflammation; Crohn's disease; diabetes; 

KW graft versus host disease; Parkinson's disease; multiple sclerosis; pain; 

KW psoriasis; anxiety; depression; schizophrenia; dementia; memory loss; 

KW mental retardation; epilepsy; asthma; tuberculosis; obesity; nausea; 

KW hypertension; hypotension; renal disorder; rheumatoid arthritis; trauma; 

KW ulcer; gene; ds . 

XX 

OS Homo sapiens . 
XX 

PN WO200261087-A2 . 
XX 

PD 08-AUG-2 002. 
XX 

PF 19-DEC-2001; 2001WO-US50107 . 
XX 

PR 19-DEC-2000; 2000US-257144P . 
XX 

PA (LIFE- ) LIFESPAN BIOSCIENCES INC. 
XX 

PI Burmer GC, Roush CL, Brown JP; 
XX 

DR WPI; 2003-046718/04. 

DR P-PSDB; ABP81815. 
XX 

PT New isolated antigenic peptides e.g., for G protein-coupled receptors 

PT (GPCR) , useful for diagnosing and designing drugs for treating 

PT conditions in which GPCRs are involved, e.g. AIDS, Alzheimer's disease, 

PT cancer or autoimmune diseases 

XX 

PS Disclosure; Fig 1; 523pp; English. 
XX 

CC The present invention describes antigenic peptides (I) comprising: 

CC (a) any one of 1601 sequences (see ABP82019 to ABP83619) of 12-24 amino 



CC acids. Also described: (1) an assay for the detection of a particular 

CC G protein-coupled receptor (GPCR) or a candidate polypeptide in a sample; 

CC and (2) an isolated antibody having high specificity and high affinity 

CC or avidity for a particular GPCR. (I) can be used as GPCR modulators and 

CC in gene therapy. The antigenic peptides for GPCRs are useful in detecting 

CC an antibody against a particular GPCR, and in the production of specific 

CC antibodies. The peptides and antibodies are also useful for detecting the 

CC presence or absence of corresponding GPCRs. The antigenic peptides for 

CC GPCRs and antibodies are useful for diagnosing and designing drugs for 

CC treating immune -related diseases, growth-related diseases, cell 

CC regeneration-related disease, immunological -related cell proliferative 

CC diseases, or autoimmune diseases, e.g. AIDS, Alzheimer's disease, 

CC atherosclerosis, bacterial, fungal, protozoan or viral infections, 

CC osteoarthritis, osteoporosis, cancer, cardiomyopathy, chronic and acute 

CC inflammation, allergies, Crohn's disease, diabetes, graft versus host 

CC disease, Parkinson's disease, multiple sclerosis, pain, psoriasis, 

CC anxiety, depression, schizophrenia, dementia, mental retardation, memory 

CC loss, epilepsy, asthma, tuberculosis, obesity, nausea, hypertension, 

CC hypotension, renal disorders, rheumatoid arthritis, trauma, ulcers, or 

CC any other disorder in which GPCRs are involved. The antibodies may be 

CC used in immunoassays and immunodiagnosis . ABZ42523 to ABZ42869 encode 

CC GPCR proteins given in ABP81675 to ABP82018, which are used in the 

CC exemplification of the present invention. 

XX 

SQ Sequence 4286 BP; 1327 A; 829 C; 816 G; 1314 T; 0 other; 

Query Match 99.6%; Score 4284.4; DB 25; Length 4286; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4285; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIII 

Db 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

Qy 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 120 

II MMI Mill IM Mill Mill II 1 II II IMM III 

Db 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 120 

Qy 121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 18 0 

1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 r i j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 e 1 1 1 1 

Db 121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 18 0 

Qy 181 AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 181 AACTTGGCTCTGAAACTGCGCAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

Qy 241 CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 

I Ml MMM II II M II I II II i II I Ml MM 

Db 241 CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 

Qy 3 01 TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 3 60 

IIIIIIIIIIIIIMIIIIIMIIIIIIIMIIIIIIIIIIIMIMIIIIIIIIMIII 

Db 301 TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

Qy 361 ACCGC^GAGATAATGACGCC^CCC^CTAAGACCTTATGGCCC^GGGTTCC^ACGCCAGT 42 0 

MIIIIIIIIIIIIIIIMIIIIMIIMIIIIIIIIIMIIIIIIMIIIIIIIIIMI 

Db 361 ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 



Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 



CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 

1 i 1 ! . ^ i 1 I ! " i ! : ■ ■ 

CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 
CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCC^TCGAGATCAAGGAGACTTTCAAA 54 0 

IIIIIIIIIIIIMIIIIIIIIIIIMIIIIIMIIIMIIIIIIIIIIIIMIIIIIII 

CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 
TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

M II II II II IMM II II II IMM MM! Mill I I 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 
CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

IMIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 
AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 72 0 

1 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 i 1 1 1 1 1 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 72 0 
CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 780 

MIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 780 
AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 

: i 1 1 - I I : ! i : i ill; 1 1 I 

AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 
GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 

M II I: IIMI I Ml II M II II Mill II M II I I 

GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 
ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

MUM II II; I MIMM IMIIMM Ml II M MMMM 

ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 
ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 1020 

M Mill II Mill II Mill Mill MMMM II M MIMI I 

ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 1020 



M II I Mill I IMM I II II I II M I II I I 



M M M II IMM II IM M IM M M II M I MM I I 



AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 12 00 

MMMM II IM MMI M I Ml M, I IM MM II M 

AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 12 00 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 

MIMMir MMI |M|| M I II I M II MMI II Mi 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 12 60 



Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1 9ft 1 

J. y O 1 


Qy 


2041 


Db 


2041 


Qy 


2101 



AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 1320 

I MM I II: II I Ml' II II I I! MM I MM III 

AG CAGGATT CTGAAGCTCACTCTTTATAAT CAGAATGATCC CAATAGATGTGAACTTTTG 1320 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 

i i i . i i ; s, i i r 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 
AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 

1 : I . i ! ' 'I I ' 'I II 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 
TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 

II II I II IMM [|"|M|M MMIIMM I IM IMM 

TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 

AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 

MIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIII! 

AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 1620 

I II IM II II I II II II I M IMM II III 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 1620 

AACAAAATGAAACATTTGCCAAAACAAAACAAAAAA 1680 

IMMM MMI II M II II II M M III M IMMM I 

AACAAAATGAAAC^TTTGCCAAAACAAAACAAAAAACT^ 1680 

TAAAATATTAAGTGTAATTATTTTAACACT CACAG CTACATATGACATTTTATGAG CTGT 1740 

IMMM MUM II IIMM IM II II II III I MIIIIMM 

TAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG CTGT 1740 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGC CT CGTCGTGAAAG CACTTAAT 18 00 

I I I ■ II ill IN; : I 1:1 Mil : II 

TTACGG CATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAG CACTTAAT 18 00 

TTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTC 18 60 

llllilllllllllllllMiiiiMlMMillllMliiiiiiiiiiiiiiiiiiiii 

TTTTTACAGTTAGCACTTCAACATAGCTCTTAAC^ I860 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 1920 

II MM I MM II MMI II, || M II II IMIMMM I 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 1920 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 

IIIIIIIIMIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIMIIMIII 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 

I MM I IM II M II II Ml M II II MMMMM II 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 

TATCACACTATTATCAGATTGTAATTAGATG CAAATGAGAGAG CAGTTTAGTTGTTG CAT 2100 

M MIMM II II I! IMMI M I MM MMI M II II 

TATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCAT 2100 

2101 TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAG CAAGG CTGT 2160 



Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 



1 1 MM 1 1 1 1 1 '! 1 1 II 1 1 1 'II 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 II II I !l 1 1 1 1!! II I 

TTTT CGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAG CAAGG CTGT 2160 
TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 222 0 

MM Ml Mill Ml M Ml I Ml MMM IM MMI II I 

TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTC^GCATTCTGa^TATGTAAC 222 0 
CAACATGTCACAAACAAG CAGCATGTAACAGACTGG CACATGTGCCAG CTGAATTTAAAA 228 0 

I I MM MMM M M II M MM MMI III MMI II I 

CAACATGTCACAAACAAG CAGCATGTAACAGACTGG CACATGTG C CAG CTGAATTTAAAA 228 0 
TATAATACTTTTAAAAAGAAAATTATTACATCCTTTAC^TTCAGTTAAGATCAAACCTCA 234 0 

I M M I Ml Ml I II M MIMIMM IM I Ml II II 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 
CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

II Ml I! i II II I Ml I II ill II II I IM MMI 

CAAAGAGAAATAGAATGTTTGAAAGG CTATCC CAAAAGACTTTTTTGAAT CTGTCATT CA 24 00 
CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 2460 

II II II i II IM I ill I II IM II II I III MMI 

CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 2460 
TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2 52 0 

M M II I M III I Ml M II Ml II II M IMMIMI 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 252 0 
CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 2 580 

MIMIMMIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIMIIIIIIIIIIIIMI 

CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 2580 
GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGGVGTGTCCACATGACAAAGGG 2 64 0 

II M II I II Ml I Ml I MMM II II I IMMIMI 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 264 0 
GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

M I MMM II M IM IMMIMI II III MMI III Ml II 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 
GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 

II MMM IMMIMI I MMI IMMMII MM MMM MMI 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 
TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 282 0 

M I IM II II MM Ml Mill. II II II M II MMM 

TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 2 820 
ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 

IMIIIIIIMIMIIIIIIIMIIMIIIIMIIIIIIIIIIIIIIIIIMIIIIIIII 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 28 8 0 
AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 

M I III I! II II II II M MMMI M II II II III II 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 294 0 
GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3 000 

IIIIMIIMIIIMIIIIIMIIIIIIIIIIIMIIIIIIIMIMIIIIIIIIMIII 



Db 


2941 


pppatpapatptptptpa a aptatpta pa apap a a a a ppp a an ananann a a a TnannTn 

VJUUri. X VJJ-rtVJJ-rt lUlOlUl 0.rt_T_r_V_J Inlui .rt. V_^AnO_Aoi _ i>_rt_ t '_ V__ VJJ ^Jr\2^\j^\^^\j±\y^\j±\±\t\ X OftuU 1 O 


t n n n 

j VJ u u 


Qy 


3001 


GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 


3060 






1 1 1 i 1 1 1 1 1 E 1 i 1 i 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 E 1 ! 1 1 1 1 1 1 1 1 E 1 1 1 1 i 1 1 1 1 1 1 1 1 E 1 




Db 


3001 


GGGTTGGAGGAAAPPPATPPPPAPAPATTPPPATTPTTAPPPTAAPPTTPPTPATTPPPT 
vjvjvj x x v_j v—JZT-n-Ti \ — \ — v — n. x v_jv_jvjvjj_^v_j^v_j.r_. x x v_,v_,v_-r_ i ili x,n\vjv_,v_, x .rxrtA vjj x x VwVjj x v__r_, x x vjj v_ v_ x 


^ OfiD 
judv 


Qy 


3061 


OTTCAC^TCAATGOy^GGTCCTGATTTTGTTCC^GCAAAACACAGTGCAATGTTCTCA 


3120 






IMIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIMM MMIMMMMMIMM 




Db 


3061 


PGTPAPATPAATGPA AAAPPTPPTPATTTTPTTPPAPPA A A A PA PAPTPPA ATPTTPTPA 

v— vj x v — ^vv__f^. x v — j. \jj v — t^c^r^r\.\j\j x v_ v_ x vjri X X X X vj X X v .y 71, ,.t^\./-\f^\n\ , r*_V_,rtV_7 X OLnn X \j X X v_, X V rt 




Qy 


3121 


GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 


3180 






IMME MM 1 II III! INI III MM lllllllllll lllllll lllllll lllllll 




Db 


3121 


GAGTGAPTTTPGAAATAAATTPPPPPPAAPAPPTTTAAPTPPPTPTTAAAATATPPPPA A 

v_xn.v_j x \_t_-_\_ xxx uunnn x r\r\r\ x x vj vjvj v_ v_ v^rtJ-V\jrt.Vj v_ x X X ,rtrt.V_ x 1L1 1 i^nJr\n X rt. 1 vjj v_.V_. v__rtrt 


"lion 
JlOU 


Qy 


3181 


ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 


3240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


3181 


ATTTTTACTTTGTTTTTPTTTTAATAGGPTGPPPPAPATPTTPPAAATAAPPTAPTAATP 

x x x x x j. j. j. v_j xxxxxv_xxxx ii^x x vjr v_ j. vjivj; vjj v_ v__rt.v__rt. x \j x x ovjn_T__r_ InnUV* X X .rtrt, X vj 


»5 Z *± U 


Qy 


3241 


TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCA 


3300 






1 1 1 II 1 II 1 1 1 1 1 M 1 II 1 1 II 1 II 1 1 II 1 1 1 1 1 II 1 1 1 II II 1 1 1 1 1 II 1 1 II 1 1 II 1 1 




Db 


3241 


TTGTTTTPTPTPAATATTPA ATPTPATPPT A P APTA A A PP A AAA PPPA A PA ATPTPPPPA 

J- x w x x x 1 >w 1 \J ± \ — lui x -ii. x x \_Lr__r_, x vj x \_xr_, x vjjvjj x _^v___-_,v_J X nnnLLnrtnnLLLJittLfifi 1 u 1 UuLUH 


7 7 n n 


Qy 


3301 


GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 


3360 






1 1 1 1 1 1 II 1 1 1 1 1 M 1 II 1 1 II 1 II 1 1 II 1 1 1 1 1 II 1 1 1 II II 1 II 1 II 1 1 1 II 1 1 II 1 1 




Db 


3301 


PA A A P A A A PA PP A A T A A T A A TT A A TTPA P A P A PP A TA TPP A TTPT A TTT A T A A A TP A PPP 
VJnrt-riOniArtXJnVjjv — rt_r_ Irtrt X_-_rt. X Xr__r_ X X V__rtV__rtV__rtV_. V__rt 1 rtk. 1 VJJVJJ_H. 1 X V_ Xrt. X X 1 /A. XirLrtrt. X V_~rt.V_.V_.v_. 


j jDU 


Qy 


3361 


ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 


3420 






1 1 1 II 1 II 1 1 1 1 1 M 1 1 II 1 II 1 II 1 1 II 1 1 1 II II 1 1 1 II II 1 1 II 1 II 1 1 1 1 1 1 II 1 1 




Db 


3361 


ACAAACTTGTTCTTTAATTTPATPPPAATPAPTTTTTPAPAPPPPTPTTATPATAPAAPT 

•"%___-__-__-_ v., - 1 - xvjx x v_ x X A_-__-_ xxx v — n. x V.\-Unnl V — rtV_ X X X X X V_-rtO-rtV_JV_JV_-V_, X VJ X In X V__rt X .rt.VjJ.rtrt. VJJ X 


_5 Z U 


Qy 


3421 


CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 


3480 






1 1 1 1 1 1 II 1 1 M 1 II II II 1 II 1 II 1 1 II 1 1 M 1 1 II M II II 1 1 II M 1 1 1 II 1 1 II 1 1 




Db 


3421 


PATTTTAGACTCTPAA TTTTAAATTA ATTTTP A ATPA PTA ATA TTTTP A P APTTT A TT A A 

v—zt. x x x x nuni^ x v_ x v — r^r^. x x x x .r^_Ti_T_ x X ilri x X X X VjJ.rt_rt. x V__rt. V x .rt_£A X .cl X X X X V__rA.V__rt.vjr XXX .rt X X _t_t_ 




Qy 


3481 


TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 


3540 






1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 II 1 II 1 II 1 1 II 1 1 M 1 II 1 M II II 1 II M II II II 1 1 II 1 1 




Db 


3481 


TATATTTAATTTPTATTTA A A TTTTAP A TT A TTTTT ATT A PP A TPT A PTP A ATTTTTAPZ_ 

x_~_ x_-_ x x inn x x x v_ x r\ x X Xrt_rtrt. x X X X rt.Vji.rt. X In X XXX X rt. X X riLLri X Vjj X _riv_- 1 vjrrtrt. XXXI IriLn. 


*5 a n 
j j _u 


Qy 


3541 


TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 


3600 






1 1 II 1 1 II 1 1 II 1 1 1 1 1 1 1 M II II 1 1 II 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 II 1 1 II 1 1 




Db 


3541 


TCCTGATACCCTTTPPTTPTPPATGTPAPTATPATPTTPTPTAATTATPTTPPPAAATTT 

x \_ v_. x v— '-ii x -ii-v — v — v — x x x v_v_ x xv_.x v — v — X vj; x v__riA_J x .£*_ X V__rt. X VJ X X V_ X V_ X t\t\ X X .rt. X V_, X X ULLftnn XXX 


7 & n n 


Qy 


3601 


TG&AACTACACACAAAAAGCATACTTGCATTATTT^ 


3660 






1 1 1 M 1 II 1 1 II 1 II 1 II II II 1 II 1 1 I II 1 1 1 1 II 1 1 1 M 1 II 1 1 1 1 1 1 1 1 II 1 II 1 1 




Db 


3601 


TPAAAPTAPAPAPAAAA APPATAPTTPPATTATTTATA ATA A A A TTP PA TTP A PTPPPTT 

iunnnv*inv^v^VJinnnnuv-nlfl_.l X V_JV rt. X X rt. X X lnli_nlnnnHl X vjj L_rt X X V__rt.VjJ X VjjvjiV_ 1 X 


jodU 


Qy 


3661 


TTTAAAAAAAATGTTTGATT(ZAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 


3720 






1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i j 




Db 


3661 


TTTAAAAAAAATGTTTPATTPAAAAPTTTA APATA PTP ATA A PTA APA A A PA ATT AT A AT 

x x ir_r_r_r_f_f_f_fiivji X X \_7.rt X 1 ___nnftL.l X inn _nln_ 1 Un lnnvj InnunnnLnn 1 lnlnnl 


_5 / __ U 


Qy 


3721 


TTCTTTACATACTCAAAACCAAGATAGAAAA^ 


77ft 0 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 It 1 1 1 1 IE 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


3721 


TTCTTTACATACTCAAAACCAAGATAGAAAAAG 


3780 


Qy 


3781 


TTCCTAGTATTAAGGACTTTAATATAGCAACAGAC-AAAATTATTGTTAACATGGATGT^ 


3840 






1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M 




Db 


3781 


TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 


3840 



Qy 3841 CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3900 

; . i : ; I . I i ' ! I i 

Db 3 841 CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3900 

Qy 3 901 GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 

IMIIIIIIIIIMMIMIMMIIIMIIIIIIIIIIIIMIIIIIIIIIIIIMIII 

Db 3901 GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 

Qy 3961 GTTTATAG CAAAACATGGGTATG CTGTAG CTAACTTTATAAAAGTGTAATATAACAATGT 4 02 0 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 3961 GTTTATAG CAAAACATGGGTATG CTGTAG CTAACTTTATAAAAGTGTAATATAACAATGT 4 02 0 

Qy 4 021 AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 08 0 

I I MM II IMM II IMMMII II M IMIIIMM Mill 

Db 4 021 AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 08 0 

Qy 4 081 TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 

I I II II IMI II II Ml ||! || M |||,|!:||| IIMI 

Db 4 081 TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 

Qy 4141 ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 

I 1 1 1 1 ' II 1 1 ! 1 1 M ! ! I 1 1 

Db 4141 ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 

Qy 42 01 CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTC^TT 4260 

M Mill 1 II M IM I Ml II II M M II II II Ml I 

Db 4201 CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAA.GTATGAAGTTA 4260 

Qy 42 61 AAAATGCCACATTTCTGGTCTCTGGG 4286 

I MUM II M IM I I 

Db 42 61 AAAATGCCACATTTCTGGTCTCTGGG 4286 

RESULT 6 
AAF21288 

ID AAF21288 standard; DNA; 13611 BP. 
XX 

AC AAF21288; 
XX 

DT 14-MAR-2001 (first entry) 
XX 

DE Human low adenosine antisense oligonucleotide related sequence #2855. 
XX 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 



Low adenosine antisense oligonucleotide; phosphorothioate; allergy; 
human; airway disorder; bronchoconstriction; lung inflammation; 
surfactant depletion; respiratory; bronchodilator ; antiinflammatory; 
immunosuppressive; antiasthmatic; analgesic; hypotensive; cytostatic; 
respiratory obstruction; pulmonary obstruction; impeded respiration; 
surfactant hypoproduct ion; pulmonary vasoconstriction; asthma; RDS; 
respiratory distress syndrome; pain; cystic fibrosis; allergic rhinitis; 
pulmonary hypertension; emphysema; pulmonary transplantation rejection; 
chronic obstructive pulmonary disease; pulmonary infection; bronchitis; 
KW cancer; ss. 
XX 

OS Homo sapiens. 
XX 



PN WO200062736-A2. 
XX 

PD 26-OCT-2000. 
XX 

PF 24-MAR-2000; 2000WO-US08020 . 
XX 

PR 06-APR-1999; 99US- 0127958 . 
XX 

PA (UYEC-) UNIV EAST CAROLINA. 

PA (NYCE/) NYCE J W. 

XX 

PI Nyce JW; 
XX 

DR WPI; 2000-679539/66. 
XX 

PT Low adenosine (A) content antisense oligonucleotides which do not 

PT trigger adenosine receptors during metabolism, useful e.g. for treating 

PT cancers and respiratory obstructions - 

XX 

PS Disclosure; Page 1277-1280; 1592pp ; English. 
XX 

CC The present invention describes low adenosine (A) content antisense 

CC oligonucleotides and compositions (I) comprising them. In the antisense 

CC oligonucleotides the A is replaced by a 'Universal' or alternative base. 

CC (I) can have respiratory, bronchodilator , antiinflammatory, analgesic, 

CC immunosuppressive, antiasthmatic, hypotensive and cytostatic activities. 

CC The antisense oligonucleotides and (I) can be used to down-regulate the 

CC expression and or activity of target polypeptides associated with 

CC lung/respiratory disorders and malignancies, such as stimulating and 

CC activating peptide factors and transmitters, transcription factors, 

CC immunoglobulins and antibodies, antibody receptors, cytokines and 

CC chemokines, endogenously produced specific and non-specific enzymes, 

CC binding proteins, adhesion molecules and their receptors, cytokine and 

CC chemokine receptors, adenosine receptors, bradykinin receptors, central 

CC ' nervous system (CNS) and peripheral nervous and non-nervous system 

CC receptors, CNS and peripheral nervous and non-nervous system peptide 

CC transmitters, defensins, growth factors, vasoactive peptides and 

CC receptors, binding proteins and malignancy associated proteins. The 

CC antisense oligonucleotides may be used in this way to treat disorders 

CC including respiratory obstruction (especially pulmonary obstruction 

CC and/or bronchoconstriction) and/or lung inflammation, allergy (ies) 

CC and/or surfactant hypoproduction which are associated with a disease or 

CC condition selected from pulmonary vasoconstriction, inflammation, 

CC allergies, asthma, impeded respiration, respiratory distress syndrome 

CC (RDS) , pain, cystic fibrosis (CF) , allergic rhinitis (AR) , pulmonary 

CC hypertension, emphysema, chronic obstructive pulmonary disease (COPD) , 

CC pulmonary transplantation rejection, pulmonary infections, bronchitis, 

CC and/or cancer. AAF18434 to AAF21543 represent human polynucleotide 

CC fragments and antisense oligonucleotides used in the exemplification of 

CC the present invention. 

XX 

SQ Sequence 13611 BP; 3676 A; 3007 C; 3056 G; 3868 T; 4 other; 

Query Match 99.6%; Score 4284.4; DB 21; Length 13611; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4285; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


Db 


1873 


Qy 


61 


Db 


1933 


Qy 


121 


Db 


1993 


Qy 


181 


Db 


2053 


Qy 


241 


Db 


2113 


Qy 


301 


Db 


2173 


Qy 


361 


Db 


2233 


Qy 


421 


Db 


2293 


Qy 


481 


Db 


2353 


Qy 


541 


Db 


2413 


Qy 


601 


Db 


2473 


Qy 


661 


Db 


2533 


wy 


hoi 


Db 


2593 


Qy 


781 


Db 


2653 



GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 6 0 

I LI 1 1 1 1 II 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 II II 1 1 1 1 1 II 1 1 1 1 

GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 1932 
AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 1992 

AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 18 0 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
AGGATCAACACAGTGG CTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACAT CTGA 2052 

AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

I ! 1 1 1 1 1 1 1 1 1 1 i I : M 1 1 1 1 'Ml 1 1 1 1 1 M I M 1 1 1 1 M 1 1 1 1 1 1 1! 1 1 1 1 

AACTTGGCTCTGAAACTGCGCAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 2112 
CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 

i 1 1 1 i 1 : 1 i , . ■ ■ : : ! i I I I i 

CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 2172 
TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

IIIIIIIIIIIIIIIMIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 2232 
ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 

IIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 2292 
CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 

IIIIIM llllllillMIIMIIIMIMI llllilllllllll MM MUM 

CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 2352 
CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 

IIIIIIIMIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 2412 
TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

: iM 1 I : MM MM I I I I 1 : ' . 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 2472 
CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIMI 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 2532 
AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 72 0 

IMIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIII 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 25 92 
CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 78 0 

II IMM IMMI IMM IMIMM IMM I !M M I Ml M 

CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 2652 
AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 

AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 2712 



Qy 



841 GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 



Db 


2713 


Qy 


901 


Db 


2773 


Qy 


961 


Db 


2833 


Qy 


1021 


Db 


2893 


Qy 


1081 


Db 


2953 


Qy 


1141 


Db 


3013 


Qy 


1201 


Db 


3073 


Qy 


1261 


Db 


3133 


Qy 


1321 


Db 


3193 


Qy 


1381 


Db 


3253 


Qy 


1441 


Db 


3313 


Qy 


1501 


Db 


3373 


Qy 


1561 


Db 


3433 


Qy 


1621 


Db 


3493 


Qy 


1681 



I I IMMI IMMI I Ml IMIIMM I MMM II Ml I 

GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 2772 
ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

I I III III IM Mill MMM IM II Ml II I MM II 

ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 2832 
ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 102 0 

I I ill II II Mill MMMMIMM III M I MM II I 

ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 2892 
AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 1080 

M II IM II IMM I II II II II I MIMI || MMM 

AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 2 952 
TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 

IIIIIIIIIIMIIIIIIIIMIIIIIilllllMIIMMIIIIIIIIIIIIIIIIIII 

TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 3 012 
AGAAAGAAAAGTGG CATGCAGATTGCTTTAAATGATCACCTAAAG CAGAGACGGGAAGTG 1200 

IMMI III II I Ml M II M II III II MMM MMMMl 

AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 3072 
GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 

IIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 3132 
AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 132 0 

IIIIIIIIIIIIIMIIIIMMIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

AGCAGGATT CTGAAG CTCACT CTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 3192 
AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 

IIIMIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIMIIIIMIIIIIMIMIM 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 3252 
AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 

IM II M II I II M II MIMM IMMI II IIIIIMM I 

AAC CCAATTG CTCTGTATTTGGTGAG CAAAAGATTCAAAAACTG CTTTAAGTCATG CTTA 3312 
TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 

M IMM ill II MM IMMI II Mill I III III II II! I 

TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 3372 
AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 

M Ml M MMM II M II Ml M IMMMM II MMI M 

AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 3432 
TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 

M II M II I II Ml II IM M II I II IMM MM I 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 34 92 
AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTATGTATTTG^ 168 0 

M MM II IMM, IMMI I Ml M M. I Ml IMM III I 

AACAAAATGAAAC^TTTGCCAAAACAAAACAAA 3552 
TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 

I MIMM IMMI IMMI I Ml I || (MM MM I MM I 



Db 


3553 


Qy 


1741 


Db 


3613 


Qy 


1801 


Db 


3673 


Qy 


1861 


Db 


3733 


Qy 


1921 


Db 


3793 


Qy 


1981 


Db 


3853 


Qy 


2041 


Db 


3913 


Qy 


2101 


Db 


3973 


Qy 


2161 


Db 


4033 


Qy 


2221 


Db 


4093 


Qy 


2281 


Db 


4153 


Qy 


2341 


Db 


4213 


Qy 


2401 


Db 


4273 


Qy 


2461 


Db 


4333 


Qy 


2521 


Db 


4393 



3553 TAAAATATTAAGTGTAA.TTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 3612 
TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 

llllllll IMIIMIM llllllllllllll IIMIMIIIIIIIIMMIIIIM 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 3672 
TTTTTACAGTTAGCACTTCAAC^TAGCTCTTAACAACTTCC^GGATATTCACACAACACT 1860 

Ml III Mill II: I III Ml IM I MM II M IM M I 

TTTTTACAGTTAGCACTTC^CATAGCTCTTAAC^CTTCCAGGATATTCAC^C^ACACT 3732 
TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 3792 
AATCAATGGGACT CTGATATAAAGGAAGAATAAGT CACTGTAAAACAGAACTTTTAAATG 1980 

I IM M IMMI IMM IM II II III II II Ml Ml III 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 3852 
AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 204 0 

I Ml M II II M II II Ml M IMMI M I! IM II 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 3912 
TATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCAT 2100 

II III I MMMI I Ml III M I ill III I MMMI II 

TATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCAT 3 972 
TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIMIIIIIIIIIIIIII 

TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAG CAAGG CTGT 4 032 
TTTTGAAAATCATTACACTTTCACTAGAAGCCC^^CCTCAGCATTCTGCAATATGTAAC 2220 

I II M IMMI Mill II II M II IMM MMMIMM 

TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 4 092 
CAACATGTCACAAACAAGCAGCATGTAACAGACT 2280 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIMIIIIIII 

CAACATGTCACAAACAAG CAG CATGTAACAGACTGG CACATGTGCCAG CTGAATTTAAAA 4152 
TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 

IIIIIIIIMMIIIIIIIIIIIIMIMIIIIIMIIIIIMIIIIIIIIIIIIIIMI 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 4212 
CAAAGAGAAATAGAATGTTTGAAAGG CTAT CC CAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

I II IM II MMMI MMM M M IMMI II II II II 

CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 4272 
CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 2460 

MMIMMMIMMI Ml M IM III Mill MM IMM MMMI 

CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 4332 
TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 252 0 

l,MI Ml II Ml II I II II IM I II, IM IMIMMI 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 4392 
CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 2580 

IIMIIIIIIIMIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIMI 

CTG CATGTAGATGATTAAATGAGGG CAGG CCCTGTGCTCATAG CTTTACGATGGAGAGAT 4452 



Qy 


2581 


Db 


4453 


Qy 


2641 


Db 


4513 


Qy 


2701 


Db 


4573 


Qy 


2761 


Db 


4633 


Qy 


2821 


Db 


4693 


Qy 


2881 


Db 


4753 


Qy 


2941 


Db 


4813 


Qy 


3001 


Db 


4873 


Qy 


3061 


Db 


4933 


Qy 


3121 


Db 


4993 


Qy 


3181 


Db 


5053 


Qy 


3241 


Db 


5113 


Qy 


3301 


Db 


5173 


Qy 


3361 


Db 


5233 



GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 264 0 

I I I II II I I I I I I I III I I I I I II I M I! I I I I III II I I I I IM I I I I I I I I I I 
GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 4 512 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 27 00 

i i ii iiiMiiiiii ii ii h mi muni in ir inn 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 4 572 
GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 

MM III 1 1 1 1 1 1 1 1 1 II I II II I II MM 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml II II 1 1 II 1 1 1 1 1 

G CTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAG CTACCTG 4 632 
TAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAG CCAATAGAAAAGTTTG CTTG 282 0 

Ml I II II II II I Mill II Ml II II I III III II 

TAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTG CTTG 4 692 
ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 

M I IM II II II I IM I IM I! II Ml I Ml I I 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 4 752 
AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 M 1 1 II M 1 1 1 1 M 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 4812 
GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 

1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 f 1 1 II 1 1 1 1 1 1 1 1 1 f 1 1 Mill 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 4872 
GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 

IIIIIIIMIMIIIIIIIIIIIIIMIIIMIIIIIIIIIMIIIIIIIIIMIIIIII 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 4 932 
CGTC^CATC^TGC^^GGTCCTGATTTTGTTCCAGCAAAACACAGTGC^TGTTCTCA 312 0 

II MMM I IMMM M Ml M^ II Ml II II M II M II 

CGTCAC^TC^TGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTGCAATGTTCTCA 4 992 
GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 3180 

IMM II I MMMM MMI M IM II II II I Ml IM II 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 5 052 
ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

1 1 1 1 1 1 ^ 1 1 ' II I [ill II 1 1 1 1 1 1 II l [ il ill ll ! 1 1 1 

ATTTTTACTTTGTTTTTCTTTTAATAGG CTGGG CCACATGTTGGAAATAAG CTAGTAATG 5112 
TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCA 33 00 

I Mill MIMI Ml I II M:MI III II II IM II II II 

TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACC^AAACCCAACAATGTC 5172 
GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 

I I III II II IMM II I M IMIIMII IMIM I III 

GAAAGAAAGAG CAATAATAATTAATT CACACAC CATATGGATTCTATTTATAAAT CACCC 5232 
ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 

IMM II IMIMMI MMI M III II il II I Ml IM II 

ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 5292 



Qy 


3421 


CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 


3480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 M 1 1 1 i 1 1 1 I 1 M 1 1 1 1 i I! 1 Ml' 




Db 


5293 


CATTTTAGAPTPTPAATTTTAAATTAATTTTGAATPAPTAATATTTTPAPAPTTTATTAA 

^ — ■xx XXXX ^i.\J£~l. \_ J. V_ X Vw^ul X X X X Xixxxx X X xxxx X X X X VJXlll X V— TiV X fUl X xx X X X X LjiW XXX xx X X iul 


5352 


Qy 


3481 


TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 


3540 






III lllill 1 II II MM Ml MM MM MM Ml MM Mil III MM IIIMI 1 




Db 


5353 


TATATTTAATTTPTATTTAAATTTTAGATTATTTTTATTAP PATGTA PTG A ATTTTT A P A 

x xx x XX x x x. nri x x luini x x nnn x x x inuni ini x x x x xx x mv^VvnivjiriVvivjnrii x x x x xx \_*xx 


S41 9 

.J *± X 


Qy 


3541 


TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 


3600 






III III MM MM lllllll MM MM III 1 MIMIMIM Ml MM MM III 




Db 


5413 


TCCTGATAPPPTTTPPTTPTPPATGTPAGTATPATGTTPTPTAATTATPTTPPPAAATTT 

x x vjxi x xt. v — v_ \ — x x x v_v_ x xv_x v_ v — cx x vj x v — ixVji in x \ — Ti x vj X X v_ X v_> inn x X /I lv_l l vJV_ \^t\nr\ XXX 


R479 


Qy 


3601 


TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 


3660 






MM III MM MM lllllll MM MM III 1 llllllllllllll llllllll III 1 




Db 


5473 


TGAAAPTAPAPAPAAAAAGPATAPTTPPATTATTTATAATAAAATTPPATTPAPTPPPTT 

x. — x .nv_-^i.\ — r^v^xTxT-rtx^rt.v_J v — rt. x ri\ — x xvjv_»nx in X X inlnnlnnnnl X\JV_-nX 1 LJiu lOuL 1 X 


R R79 


Qy 


3661 


TTTAAAAAAAATGTTTGATTQUU^ACTTTAACATACTGATAAGTAAGAAACAATTATAAT 


3720 






MIIIMI III MM IIIMIIIIIIMMIII III II II II III MIMMIMIMM 




Db 


5533 


TTTAAAAAAAATGTTTGATTPAAAA PTTTAA PA T A PTP A TA A PT A A PA A A PA A TT A T A A T 

xxx ^innj^nnnn. x vj x x x vjn x x L-nrifiri.^ X X x ,fvtt.v__M. x x vjn. X nn\j X s\s^^±\ns\\^t\t\ X X n. X nn X 


J 


Qy 


3721 


TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAAC^TGT 


3780 






1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 ] 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 E 1 




Db 


5593 


TTCTTTACATAPTPAAAAPPAAPATAPAAAAAPPTGPTATPnTTPA A PTTPA A A A PA TPT 


JO J£i 


Qy 


3781 


TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 


3840 






1 1 1 1 1 1 1 1 1 1 1 It 1 1 1 1 1 1 r M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 M 1 1 1 1 1 1 




Db 


5653 


TTCCTAGTATTAAGGAPTTTAATATAPPAAPAPAPAAAATTATTPTT A A P A TPP A TPTT A 

J- J- x xxv_-i x XX x x, xxxx^wJ v_Jxx V — x x X nn X xx x \^nn \ ta ^xxv w Jxx V_xiXuV4 X X XX X X VJ X inrlVwn X VJuxl X VJ X X XX 


S71 9 

■J / 


Qy 


3841 


CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 


3900 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M ! 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 




Db 


5713 


CAGCTCAAAAGATTTATAAAAGATTTTAAPPTATTTTPTPPPTTATTATPPAPTPPTAAT 

i x— X w*fTXTTl\_)^i j. x J- X^. x iUUUiVJXi XXX X. xxxx ^ \w X. xx X X X X X V_\_ V_ X X XX X X xx X V— \^-fi\w X VJ\_ X xxxx X 


5772 


Qy 


3901 


GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 


3960 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 !! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


5773 


GTGGATGTATGTTPAAAPACPTTTTAGTATTPATAPPTTAPATATPPPPAAAPPAATAPA 

' x vj v—jx x x y—> x xx x \_i x x v^nriri v — ex xxxx iivj x xx x x vjxi x xxvj \_ x x Awl X xx x VJ VJ V— V, niPJI V ~\ ~Jr\r\ X xx Lxl 


JO J^i 


Qy 


3961 


GTTTATAGOW^ACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 


4020 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1! 1 




Db 


5833 


GTTTATAG CAAAAPATGGGTATGPTGTAGPTA A PTTT A T A A A A PTPT A A TATA A PA A TPT 

v_« x x x n. x n.vj v.^nnn^Hw. x^. x vjvjvj x n. x vj\_ x vj x n\jv_ innu x x XninrulnUlOlnnlnlnnLnnlul 


O -/ z 


Qy 


4021 


AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 


4080 






1 II 1 1 1 1 1 1 ! 1 1 1 II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


5893 


AAAAAATTATATATCTGGGAGGATTTTTTGGTTG CCTAAAGTGG CTATAGTTACTGATTT 


5952 


Qy 


4081 


TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 


4140 






1 M 1 1 1 II II 1 1 1 1 1 M Ml 1 II II II III 1 M 1 II II II 1 II M II II II II III M 




Db 


5953 


TTTATTATGTAAGPAAAAPPAATAA A A ATTTA AGTTTTTTT A A PA A PT A PPTTA TTTTTP 

x x x ±i x x xx x \_j x ruiU VXiiUul , v^xxxx X xixixixxxx X X X xxxxV_J X X X X X X X xxxx oixri X xx V_ V_ X X xl X XXX X V_ 


DU1Z 


Qy 


4141 


ACTGTACAGACACTAA.TTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 


4200 


Db 


6013 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATPTPA 

* *■ v — ' v a *i \^x xvjx x \_~x x V— » x xj: x X Vp#ja X X *Uul X xx X xxxx X X VJii X X \J XXX iixiriiiVjAJUiT^ X xx X xxxxxx X VJ X V_7-f> 


079 


Qy 


4201 


CAAGTGGACATTATTTATGTTAAATATACAATTATCAAG CAAGTATGAAGTTATT CAATT 


4260 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 r 




Db 


6073 


CAAGTGGACATTATTTATGTTAAATATACnATTATCAAGC-AAGTATGAAGTTATTCAATT 


6132 


Qy 


4261 


AAAATGCCACATTTCTGGTCTCTGGG 4286 





IMIIIIIIIIIIIIIIIIIIIIIII 

Db 6133 AAAATGCCACATTTCTGGTCTCTGGG 6158 



RESULT 7 
AAA35166 

ID AAA35166 standard; DNA; 13612 BP. 
XX 

AC AAA35166; 
XX 

DT 28-JUL-2000 (first entry) 
XX 

DE Human adenosine receptor related polynucleotide 2nd SEQ ID NO: 40. 
XX 

KW Human; adenosine receptor; low adenosine antisense oligonucleotide; 

KW phosphorothioate; impaired respiration; inflammation; allergy; 

KW allergic disease; bronchoconstriction; inhibitor; antiinflammatory; 

KW antiallergic; antiasthmatic; cytostatic; analgesic; impaired airway; 

KW lung disease; ischaemic condition; pulmonary vasoconstriction; asthma; 

KW respiratory distress syndrome; pain; cystic fibrosis; emphysema; 

KW pulmonary hypertension; chronic obstructive pulmonary disease; COPD; 

KW cancer; leukaemia; lymphoma; carcinoma; metastasis; ss. 

XX 

OS Homo sapiens . 
XX 

PN WO200009525-A2 . 
XX 

PD 24-FEB-2000. 
XX 

PF 03-AUG-1999; 99WO-US17712 . 
XX 

PR 03-AUG-1998; 98US-0095212 . 
XX 

PA (UYEC-) UNIV EAST CAROLINA. 
XX 

PI Nyce JW; 
XX 

DR WPI; 2000-205971/18. 
XX 

PT New antisense oligonucleotides useful for treating e.g. pulmonary 

PT vasoconstriction, inflammation, allergies, asthma, hypertension, 

PT bronchitis, emphysema, respiratory distress syndrome, ischemia or 

PT cancers 
XX 

PS Disclosure; Page 1194-1197; 1343pp ; English. 
XX 

CC The present invention describes a new composition comprising an 

CC antisense oligonucleotide (ON) with low adenosine (up to 15%) , which 

CC targets nucleic acids involved in bronchoconstriction, allergies, and/or 

CC inflammation. The ON can have antiinflammatory, antiallergic, 

CC antiasthmatic, cytostatic and analgesic activities. The compositions are 

CC useful for the treatment of diseases associated with inflammation, 

CC impaired airways, including lung disease and diseases whose secondary 

CC effects afflict the lungs of a subject. They can be used for treating 

CC e.g. ischaemic conditions, pulmonary vasoconstriction, allergies, 

CC asthma, impeded respiration, respiratory distress syndrome, pain, cystic 

CC fibrosis, pulmonary hypertension, emphysema, chronic obstructive 



CC pulmonary disease (COPD) , and cancers such as leukaemias, lymphomas, 

CC carcinomas, and cancers which may metastasise to the lungs, including 

CC breast and prostate cancer. The reduction of the adenosine content of 

CC the ONs reduces side effects. The A-containing ONs break down with the 

CC release of deoxyadenosine which activates adenosine receptors causing 

CC bronchoconstriction and inflammation. AAA32313 to AAA35312 represent the 

CC nucleotide sequences given in the sequence listing from the present 

CC invention, which correspond to SEQ ID N0:1 to 2815, and then the last 

CC 185 sequences are also called SEQ ID N0:1 to 185, but the sequences 

CC differ from the previously named sequences. SEQ ID NO: 11 to 1680 

CC (AAA32323 to AAA33992) are specifically claimed ONs from the present 

CC invention. N.B. Sequences given in the disclosure of the present 

CC invention do not match up with their corresponding SEQ ID NO: sequences 

CC given in the sequence listing. 

XX 

SQ Sequence 13612 BP; 3677 A; 3007 C; 3056 G; 3868 T; 4 other; 

Query Match 99.6%; Score 4284.4; DB 21; Length 13612; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4285; Conservative 0; Mismatches 1; Indels 0; Gaps. 0; 

GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 6 0 

1 1 m 1 1 ii i ; 1 1 i i 1 1 :i 1 1 1 1 1 1 1 1 1 iii 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 m ii 1 1 1 1 1 1 



I : I ■ i : i i I i : : I ' : I i I I I I ' I ■ ! ■ I 

AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 

AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 

Ii II I I II II INN I II IN I El II I II MM 

AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 
AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 

IMIIIIIIIIIIIIIIIII IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMI 

AACTTGGCTCTGAAACTGCGCAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 
CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 

MlMMMII MM M MIMMIMMMIM II I 

CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 
TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 

Ml IMMI MM MIMM MM MMMIIMI III II I 

TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 
ACCGCAGAGATAATGACGCCA.CCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 

IM MM II MM M I II II I Ml II I II IMM 

ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCO^GGGTTCCAACGCCAGT 
CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 

I II M MIMI M IMM MM! MM II II M III II I 

CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 
CC^CGC^CC^TCTCCCCTCCCCCGTGCC^GGACCC^TCGAGATC^GGAGACTTTC^^ 

IMM IIIMIMI Ml IMM ll MM MIMI IMIMIII 



Qy 


1 


Db 


1873 


Qy 


61 


Db 


1933 


Qy 


121 


Db 


1993 


Qy 


181 


Db 


2053 


Qy 


241 


Db 


2113 


Qy 


301 


Db 


2173 


Qy 


361 


Db 


2233 


Qy 


421 


Db 


2293 


Qy 


481 


Db 


2353 



Qy 


541 


Db 


2413 


Qy 


601 


Db 


2473 


Qy 


661 


Db 


2533 


Qy 


721 


Db 


2593 


Qy 


781 


Db 


2653 


Qy 


841 


Db 


2713 


Qy 


901 


Db 


2773 


Qy 


961 


Db 


2833 


Qy 


1021 


Db 


2893 


Qy 


1081 


Db 


2953 


Qy 


1141 


Db 


3013 


Qy 


1201 


Db 


3073 


Qy 


1261 


Db 


3133 


Qy 


1321 


Db 


3193 


Qy 


1381 



TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

IIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 2472 
CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIMIIIMIIIIIIIII 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 2532 
AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 720 

IIIIIIIIIIIMIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMMIIMI 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 2592 
CTGCTGGCAGAGGACTGG CCATTTGGAG CTGAGATGTGTAAG CTGGTG CCTTTCATACAG 780 

IIIIIMIIIIIIIMIIIIIIMIIIIIIIMIIIIIIIIIIIMMIIIIIIIIMII 

CTGCTGGCAGAGGACTGG CCATTTGGAG CTGAGATGTGTAAGCTGGTG CCTTTCATACAG 2652 
AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 

IIIIIIIIIIIMIIIIMIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIMMIIMI 

AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 2712 
GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 

IIIIIIIIIMIIIIIIIIIIIIIMIIIMIIIIIIIIMIMIIIIIIIIIIIIIIII 

GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 2772 
ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

MIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMMIIIII 

ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 2832 
ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 102 0 

IMIIIIIMIIMIMIIIMIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIM 

ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 2892 
AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGT 1080 

IIIMIIIIMIIIIIIIIIIIIIMIMIIIIIIIMIIIIIIIIIIIIIIIIMIMI 

AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 2 952 
TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 

II M II Ml M II I IM II II MIMIMII M MIMM II 

TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 3012 
AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 1200 

I Mill II. M II MMI II I! II M 1 II I MMMI II 

AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 3 072 
GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 

II II II III M II I II II MMMMIMM II III II M 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 3132 
AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 1320 

M II IM IM I IMM Ml II M II I MM I Ml MIMI 

AGCAGGATTCTGAAG CT CACTCTTTATAATCAGAATGATC CCAATAGATGTGAACTTTTG 3192 
AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 

I MIMI III M II MMI II MMIMM II Ml III II I 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 3252 
1381 AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCA?WVACTGCTTTAAGTCATGCTTA 144 0 



Db 


3253 


Qy 


1441 


Db 


3313 


Qy 


1501 


Db 


3373 


Qy 


1561 


Db 


3433 


Qy 


1621 


Db 


3493 


Qy 


1681 


Db 


3553 


Qy 


1741 


Db 


3613 


Qy 


1801 


Db 


3673 


Qy 


1861 


Db 


3733 


Qy 


1921 


Db 


3793 


Qy 


1981 


Db 


3853 


Qy 


2041 


Db 


3913 


Qy 


2101 


Db 


3973 


Qy 


2161 


Db 


4033 


Qy 


2221 



IIIIIIMIMIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIMIII 

AACCCAATTGCTCTGTATTTGGTGAGC^AAAGATTCAAAAACTGCTTTAAGTCATGCTTA 3312 
TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 

M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 ! 1 1 II I! 1 1 1 II I ; 1 1 II 1 1 1! II 1 1 . II 1 1 1 1 1 1 1 1 i I I 

TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 3372 
AAGTTCAAAGCTAATGATCACGGATATGAa\ACTTCCGTTCCAGTAATAAATACAGCTCA 1560 

IIIIIMIMIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIMMIII 

AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 3432 
TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 1620 

II ! Ml Ml II M II Ml i Ml II IM I II IMM 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 34 92 
AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTATC 1680 

II MUM III II IMM II I Ml II III I II II M 

AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTATGTATTTGCA 3552 
TAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG CTGT 174 0 

II MM Ml M MMI Ml I Ml II IM I Ml MMI 

TAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG CTGT 3 612 
TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 18 00 

IIIIIIIIMIIIIMIIhllllllllllll IIIIIIIIIMI IIIIMIIIIII I 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 3672 
TTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACT^ 1860 

II Mill IM M: MMI !M I III II M' I Ml MMI 

TTTTTACAGTTAGCACTTCAACATAGCTCTTAACA^ 3732 
TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 

i i 1 1 1 M E 1 1 1 1 E 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 S ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 3792 
AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 198 0 

II I Ml III Ml I! II III I Ml Ml II Mill MM 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 3852 
AAGCTTAAATTACTC^TTTAAAATTTTAAAATCCTTTAAAAC^CTTTTC^TTAATAT 2 04 0 

II M IM II II M Ml II I Ml II IM I Ml II II 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 3 912 
TATCACACTATTATCAGATTGTAATTAGATG CAAATGAGAGAG CAGTTTAGTTGTTG CAT 2100 

M 1 I II II Ml M II MMM IM II Ml M II Ml M 

TATCACACTATTATCAGATTGTAATTAGATG CAAATGAGAGAG CAGTTTAGTTGTTG CAT 3972 
TTTTCGGACACTGGAAAC^TTTAAATGATCAGGAGGGAGTAAC^GAAAGAGCAAGGCTGT 2160 

IIIIMMIIIMMIIIMIIIMIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIMI 

TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAG CAAGG CTGT 4 032 
TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCT 2220 

1 1 II II 1 1 III Mil MM III II II I INI I IMM IIIIIIIIIMI 

TTTTGAAAATCATTAC^CTTTCACTAGAAGCCCAAACCTC^GCATTCTGC^ATATGTAAC 4 092 
CAACATGTCACAAACAAGCAGCATGTAACAGACTGGCA^ 228 0 

IM M ill II Ml M MM IM II II II IM I Ml MMI 



Db 


4093 


Qy 


2281 


Db 


4153 


Qy 


2341 


Db 


4213 


Qy 


2401 


Db 


4273 


Qy 


2461 


Db 


4333 


Qy 


2521 


Db 


4393 


Qy 


2581 


Db 


4453 


Qy 


2641 


Db 


4513 


Qy 


2701 


Db 


4573 


Qy 


2761 


Db 


4633 


Qy 


2821 


Db 


4693 


Qy 


2881 


Db 


4753 


Qy 


2941 


Db 


4813 


Ov 


3001 


Db 


4873 


Qy 


3061 


Db 


4933 



4 093 CAACATGTCACAAACAAGCAGCATGTAACAGA 4152 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTA(ZATTCAGTTAAGATCAAACCT^ 234 0 

I I I I I I I I I I I i I I II I I II I I I I II I II II I II II I I I I II II II I I I I I I I I I I I 
TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 4212 

CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

I MIL II II M J Mil II II Ml I ill M II ,11 I 

CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 4272 
CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 2460 

i 1 ii iii ii it ii i ii iinn ii i mm i ii iihii 

CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 4332 
TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 252 0 

1 1 f 1 1 1 1 ! 1 1 1 1 1 1 1 E 1 1 i 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 4392 
CTG CATGTAGATGATTAAATGAGGG CAGG CCCTGTGCT CATAGCTTTACGATGGAGAGAT 2580 

I I I I ■ I I i ■ ! : I - i : ' I III : I i : I 

CTG CATGTAGATGATTAAATGAGGG CAGG C C CTGTG CT CATAG CTTTACGATGGAGAGAT 4452 
GCC^GTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGAC^AAGGG 264 0 

IIIIMIIIIIIIIIIIIIIMIIIMIMMIIIIIIIIIIIIIIIIIIIIIIIIMI! 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 4512 
GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

II II IMMI II II I II MMIM IIMM III II II Mill 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 4572 
GCTATAGTTAAAATACTATTTTTOW^ATCATACAGATTAGTAC^TTTAACAGCTACCTG 2760 

IIIIIIIMIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 



TAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAG C CAATAGAAAAGTTTGCTTG 282 0 

I I III MMIM II I II IIMM II Ml III II II IIMM 

TAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAG C CAATAGAAAAGTTTGCTTG 4 692 
AOVTGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 288 0 

IIIMMIIIIMIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIMMIIIII 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 4752 
AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 

IIMIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 4812 
GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 4872 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3 060 

I 1 -I I I 1 , ;! I !,ll I h, ' ■! 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 4 932 

CGTCACATCAATGCAAAAGGTCCTGATTTTGTTCC^ 312 0 

IIIMIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

CGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAG 4 992 



Qy 


3121 


Db 


4993 


Qy 


3181 


Db 


5053 


Qy 


3241 


Db 


5113 


Qy 


3301 


Db 


5173 


Qy 


3361 


Db 


5233 


Qy 


3421 


Db 


5293 


Qy 


3481 


Db 


5353 


Qy 


3541 


Db 


5413 


Qy 


3601 


Db 


5473 


Qy 


3661 


Db 


5533 


Qy 


3721 


Db 


5593 


Qy 


3781 


Db 


5653 


Qy 


3841 


Db 


5713 


Qy 


3901 


Db 


5773 



GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 3180 

IMIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIII 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 5052 
ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 3240 

IMIMIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIMMIIIIIII 

ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 5112 
TTGTTTTCTGTC^TATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCA 3300 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCA 5172 



M ii 'I it II mil II I II ;ii 'II I ill Ml I 



ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 3420 

llllllllll MM IIIIIIIIMIIIIIIIIIIIMIIIIIII lllllllllllllll 

ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 52 92 
CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 3480 

II II M M M II Ml Mi l MIMI II I III III I 

CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 5352 
TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

II II II II Mill III II M II II IM I MMMi II 

TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 5412 
TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 

MM:MM II Ml IM II II M II IM II M: IM IM 

TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 5472 
TGAAACTACACACAAAAAG CATACTTG CATTATTTATAATAAAATTGCATTCAGTGG CTT 3660 

II MIMI II MUM MIMI MMI II III I MMMI II 

TGAAACTACACACAAAAAG CATACTTG CATTATTTATAATAAAATTGCATTCAGTGG CTT 5532 
TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 3720 

II Ml M II MUM IM II I Ml II IM I MM IMM 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACA.TACTGATAAGTAAGAAACAATTATAAT 55 92 
TTCTTTAC71TACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACT 378 0 

MIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIMIMIIIIIMIIIIIIIIIIIIII 

TTCTTTACATACTC^W^CCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 5652 
TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIMMIIII 

TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 5712 
CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3 900 

IIIMIIMIIIIII MIMIMIMMMMMIMMMMIMMIIIMIIIMI 

CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 5772 
GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3960 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIMIIIIIII 

GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 5832 



Qy 3961 GTTTATAG CAAAACATGGGTATG CTGTAG CTAACTTTATAAAAGTGTAATATAACAATGT 4020 

I : i II: I I i ; ; : i - 1 1 I ; I 1 1 1 1 ; i ! i . 1 1 

Db 5833 GTTTATAG CAAAACATGGGTATG CTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 58 92 

Qy 4 021 AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 080 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 5893 AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 5952 

Qy 4 081 TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 

IlilllMIIIMIIMIIIIIIIIIIIIIIIIIIIIMMIIIMIIIIIIIIIIIIM 

Db 5953 TTTATTATGTAAGC^AAACO^ATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 6012 

Qy 4141 ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 

IIIMIIIIIIIIIMIIIIIMIIMMIMMIIIIIIIIIIMIIIIIIMIIIIII 

Db 6013 ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 6 072 

Qy 42 01 C^GTGGAC^TTATTTATGTTAAATATACT^TTATC^GCAAGTATGAAGTTATTCAATT 4260 

IIIIMIIIMIIIIIIIMIIIIMIIIIIIIMIMIIIIIIIIIIIIIIIMIMII 

Db 6073 CAAGTGGAC^TTATTTATGTTAAATATACAATTATC^GC^GTATGAAGTTATTCAATT 6132 

Qy 4261 AAAATGCCACATTTCTGGTCTCTGGG 4286 

1 1 1 I Mi 1 1 II I il 1 1 1 1 1 1 Ml 

Db 6133 AAAATGCCACATTTCTGGTCTCTGGG 6158 



RESULT 8 
ABK94410 

ID ABK94410 standard; DNA; 2972 BP. 
XX 

AC ABK94410; 
XX 

DT 27-AUG-2002 {first entry) 
XX 

DE DNA encoding endothelin receptor B (EDNRB) , exon 7. 
XX 

KW Endothelin; EDN; endothelin converting enzyme; ECE; EDNRB; 

KW endothelin receptor B; signaling system; cardiovascular disease; 

KW coronary heart disease; hypertension; atherosclerosis; angiogenesis ; 

KW fatty acid metabolism; diabetes; familial hypercholesterolemia; 

KW forensic marker; transgenic animal; solid support; SNP; 

KW cardiovascular regulator; gene; ds; single nucleotide polymorphism. 

XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT variation replace { 1048 , A) 

FT /*tag= a 

FT /standard_name= "Single nucleotide polymorphism" 

FT variation replace ( 1658 , C) 

FT /*tag= b 

FT /standard_name= "Single nucleotide polymorphism" 

FT variation replace (1912 ,T) 

FT /*tag= c 

FT /standard_name= "Single nucleotide polymorphism" 

FT variation replace (2130 , T) 

FT /*tag= d 

FT /standard_name= "Single nucleotide polymorphism" 



XX 

PN WO200224747-A2 . 
XX 

PD 28-MAR-2002. 
XX 

PF 31-AUG-2001; 2 001WO-EP10087 . 
XX 

PR 19-SEP-2000; 2000EP-0120123 . 
XX 

PA (EPID-) EPIDAUROS B I OTECHNOLOG I E AG. 
XX 

PI Brinkmann U, Hoffmeyer S; 
XX 

DR WPI; 2002-435060/46. 
XX 

PT Novel polynucleotide of the endothelin/endothelin converting 

PT enzyme/receptors of endothelin and endothelin converting enzyme 

PT signaling system associated with cardiovascular disease, useful for 

PT treating the disease 

XX 

PS Claim 1; Page -; 190pp; English. 
XX 

CC The invention describes a polynucleotide (I) of the endothelin 

CC (EDN) /endothelin converting enzyme (ECE) /receptors of EDN and ECE (EDNR) 

CC signaling system which is associated with a cardiovascular disease. (I) , 

CC the gene encoding EDN, ECE or EDNR (II) or a vector (III) expressing (I) 

CC or (II) is useful for producing cells capable of expressing a molecular 

CC variant polypeptide which is associated with a cardiovascular disease. 

CC (II), (III), the EDN, ECE or EDNR polypeptide, or a cell expressing 

CC a molecular variant gene comprising (I) is useful for identifying and 

CC obtaining a pro-drug or drug capable of modulating the activity of a 

CC molecular variant of a polypeptide of the EDN/EDNR/ECE signaling system 

CC or its gene product, or for identifying and obtaining an inhibitor of 

CC the activity of a molecular variant of a polypeptide of the EDN/EDNR/ECE 

CC signaling system or its gene product. The isolated proteins and 

CC polynucleotides encoding them are useful for preparation of a 

CC pharmaceutical composition for treating a cardiovascular disease such as 

CC coronary heart disease, hypertension, atherosclerosis, or related to 

CC abnormal angiogenesis or fatty acid metabolism e.g. diabetes and familial 

CC hypercholesterolaemia . The gene or a polynucleotide fragment of the 

CC EDN/ECE/EDNR signaling system are useful as forensic markers, for 

CC creating a transgenic animal and in creation of a solid support 

CC comprising polynucleotides, genes, vectors, polypeptides, antibodies or 

CC host cells of the invention. This sequence encodes a fragment of the 

CC cardioavscular regulator Endothelin receptor B (EDNRB) . 

CC Note: This sequence does not appear in the specification but has been 

CC obtained from GenBank using information given in the invention. 

XX 

SQ Sequence 2972 BP; 1018 A; 499 C; 465 G; 990 T; 0 other; 

Query Match 66.4%; Score 2857; DB 24; Length 2972; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2857; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1430 AGTCATGCTTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGC 1489 

M; M I M ! I, III I II Ml I II II IIMM || Mini 

Db 9 AGTCATGCTTATG CTG CTGGTG CCAGTCATTTGAAGAAAAACAGTC CTTGGAGGAAAAGC 68 



Qy 


1490 


Db 


69 


Qy 


1550 


Db 


129 


Qy 


1610 


Db 


189 


Qy 


1670 


Db 


249 


Qy 


1730 


Db 


309 


Qy 


1790 


Db 


369 


Qy 


1850 


Db 


429 


Qy 


1910 


Db 


489 


Qy 


1970 


Db 


549 


Qy 


2030 


Db 


609 


Qy 


2090 


Db 


669 


Qy 


2150 


Db 


729 


Qy 


2210 


Db 


789 


Qy 


2270 


Db 


849 



AGTCGTG CTTAAAGTTCAAAGCTAATGAT CACGGATATGACAACTTCCGTTCCAGTAATA 154 9 

IMIIIIIIIIMIIIIIIMIMIIIIIIIMIIIIIIIIIIIIIIIIIIIMMIIII 

AGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATA 128 
AATACAGCTCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACC 1609 

IIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AATACAGCTCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACC 188 
GAAGTCATTAAAACAAAATGAAAC^TTTGCCAAAAC^AAACAAAAAACTATGTATTTGCA 1669 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIII 

GAAGTCATTAAAAC^AAATGAAACATTTGCCAAAACAAAACAAAAAACTATGTATTTGCA 24 8 
CAGCACACTATTAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATT 172 9 

I M I. I Ml M II M I II M M II II I I MMIM 

CAGCACACTATTAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATT 308 
TTATGAG CTGTTTACGGCATGGAAAGAAAAT CAGTGGGAATTAAGAAAGC CTCGTCGTGA 178 9 

MIIIIIIIIMIIIIIIIIIIIIMIIIIMIIIIMIIIIMIIIIIIIMIIIIIII 

TTATGAG CTGTTTACGGCATGGAAAGAAAAT CAGTGGGAATTAAGAAAG CCTCGTCGTGA 368 
AAGC^CTTAATTTTTTACAGTTAGC^CTTC^(^TAGCTCTTAA(^CTTC(^GGATAT^ 184 9 

II IM II I Ml MMI I II II IMIMM II I MHI II 

AAGCACTTAATTTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATT 428 



MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMMIIIIIIIIIII 



ATTTATTTTTAAATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGA 1969 

M IM M I II IMM II II II I MMM II I II II II 

ATTTATTTTTAAATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGA 548 
ACTTTTAAATGAAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTT 2 02 9 

I II M IMIIMI MM II IMM I Mill MIMIIIIMI 

ACTTTTAAATGAAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTT 608 



II MM: I II II I i II II i IMII II I I II II 



AGTTGTTGCATTTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAG 214 9 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIII 

AGTTGTTGCATTTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAG 72 8 
AGCAAGGCTGTTTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGC^ 22 09 

IMM M III IMIIMMM MMIM I M Ml II, I I MM 

AGCAAGGCTGTTTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTG 788 



I M M IMIIMI M. I II MMI I MMM,i M II I 



TGAATTTAAAATATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAG 232 9 

M M I MIMI I M II II I II M MIMMMMI MMM 

TGAATTTAAAATATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAG 908 



Qy 


2330 


Db 


909 


Qy 


2390 


Db 


969 


Qy 


2450 


Db 


1029 


Qy 


2510 


Db 


1089 


Qy 


2570 


Db 


1149 


Qy 


2630 


Db 


1209 


Qy 


2690 


Db 


1269 


Qy 


2750 


Db 


1329 


Qy 


2810 


Db 


1389 


Qy 


2870 


Db 


1449 


Qy 


2930 


Db 


1509 


Qy 


2990 


Db 


1569 


Qy 


3050 


Db 


1629 


Qy 


3110 


Db 


1689 


Qy 


3170 



ATO^AACCTCACAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAA 2389 

M II IM MM! IM II IIMM II III IM II I IMMI II 

ATCAAACCTCACAAAGAGAAATAGAATGTTTGAAAGGCTATCCC^W^AGACTTTTTTGAA 968 
TCTGTCATTCACATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAA 244 9 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMMIIIMIIII 

TCTGTCATTCACATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAA 1028 
ATCTTCTTTTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTT 2509 

M II IM Mill MM II IIMM II Ml II II M IIMM II 

ATCTTCTTTTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTT 1088 
ACCTACATACACTG CATGTAGATGATTAAATGAGGGCAGG C CCTGTG CTCATAGCTTTAC 2569 

IIIIIIIIIMIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

ACCTACATACACTG CATGTAGATGATTAAATGAGGGCAGG C CCTGTG CTCATAGCTTTAC 1148 
GATGGAGAGATGCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCAC 262 9 

I M IM II I MM IMM II I Ml II IM I IIMM II 

GATGGAGAGATGCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCAC 1208 
ATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGC 268 9 

M IMMI MM II II I II MIMM IM II II IMMI MM 

ATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGC 1268 
ATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTA 274 9 

M II IM II M MM M II II I II M Ml I MM MM 

ATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTA 132 8 
ACAGCTACCTGTAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAA 2809 

M MIMI III M Ml I II IIMM IM II II MUM II, I 

ACAGCTACCTGTAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAA 1388 
AAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGT 2869 

M II M' IM I M'M 1 II II I Ml M MMMI Ml II M 

AAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGT 1448 
AAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCT 2 929 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

AAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCT 1508 
TAGGATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAG 2989 

IIMIIIIIIIMIIMIIIIMIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIII 

TAGGATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAG 1568 
GAAATGAGGTGGGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTT 304 9 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIII 

GAAATGAGGTGGGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTT 1628 
CGTCATTGCCTCGTCACATCAATGCAAAAGGTCCTGATTTTGTTCC^^ 3109 

IIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIM 

CGTCATTGCCTCGTCACATCAATGCAAAAGGTCCTGATTTTGTTC 1688 
CAATGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAA 3169 

IM M IM I II II M II M Ml Ml II IM I II M 

CAATGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAA 1748 
3170 AATATGCCCAAATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATA 3229 



Db 


1749 


Qy 


3230 


Db 


1809 


Qy 


3290 


Db 


1869 


Qy 


3350 


Db 


1929 


Qy 


3410 
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1989 


Qy 


3470 


Db 


2049 


Qy 


3530 


Db 


2109 


Qy 


3590 


Db 


2169 


Qy 


3650 


Db 


2229 


Qy 


3710 


Db 


2289 


Qy 


3770 


Db 


2349 


Qy 


3830 


Db 


2409 


Qy 


3890 


Db 


2469 


Qy 


3950 


Db 


2529 


Qy 


4010 



Ml M IMIM I II MMMI I IMIIM MUM 

AATATGCCCAAATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATA 1808 

AGCTAGTAATGTTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAA 3289 
l.lll II I llllll I II IlilMI II II 111:1 MUM I 

AGCTAGTAATGTTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAA 1868 
C^TGTGGCC^GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTT 334 9 

I I : 1 1 il l I II 1 1 i 1 1 : 1 1 II II ! i < 1 1 1 1 : 1 1 l ! 

CAATGTGGCCAGAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTT 1928 
ATAAATCACCCACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTT 34 09 

i mm ii mmi mimm imimmimm mm ii m 

ATAAATCACCCACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTT 1988 

ATC^TAGAAGTCATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCA 3469 

I. II: I, I ,11 I M II I :ll II ,1 Ml III I IIMI. I 

ATCATAGAAGTCATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCA 2048 

CAGTTTATTAATATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACT 3529 

Mill mm mm Ml III Mill ll mmi imm mi mm 

CAGTTTATTAATATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACT 2108 
GAATTTTTACATCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATC 358 9 

iiiimiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiimiiiiiiiiiimimi 

GAATTTTTA CATC CTGATACCCTTTCCTTCTCCATGTCAG TAT CATGTTCTCTAATTATC 2168 
TTG CCAAATTTTGAAACTACACACAAAAAG CATACTTG CATTATTTATAATAAAATTGCA 364 9 

II III II M II IMM MM II MIMMIMM 

TTG CCAAATTTTGAAACTACACACAAAAA.G CATACTTG CATTAT 2228 
TTCAGTGGCTTTTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAA 3709 

i MMMMIM I II I II MM MUM IMIIM 

TTCAGTGGCTTTTTAAAAAAAATGTTTGATTCJUU^^ 2288 
ACAATTATAATTTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACT 3769 

M II MM II M M II MMMI II II IMIM IMM 

ACAATTATAATTTCTTTACATACTCAAAACCAAGATAGAAAAAGGTG 2348 
TCAAAACATGTTTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAA 382 9 

M Ml M MMI MM IMM II II I MM I MIMI I 

TCAAAACATGTTTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAA 24 08 
CATGGATGTTACAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTAT 388 9 

M M M II II I II II MIMM MM II M II II 

CATGGATGTTACAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTAT 24 68 



IMM IMIIMM MMMM I IMMM.MIM I Ml MM 



IMIM IMIMIMM Ml MMMMIM MMMMIM I M 



TATAACAATGTAAAAAATTATATATCTGGGAGGATTTTTTGGTTG CCTAAAGTGG CTATA 4 069 

M MMIMMIMI MIMIMII II II M IM MMMMM I 



Db 



258 9 TATAACAATGTAAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATA 264 8 



Qy . 4 07 0 GTTACTGATTTTTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTA 412 9 

MIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 

Db 2 64 9 GTTACTGATTTTTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTA 2708 

Qy 4130 CCTTATTTTTCACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAA 418 9 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII! 

Db 2709 CCTTATTTTTCACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAA 2768 

Qy 4190 TATAAATGTGACAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAA 424 9 

MIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIMIMI 

Db 2769 TATAAATGTGACAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAA 282 8 

Qy 42 5 0 GTTATTCAATTAAAATGCCACATTTCTGGTCTCTGGG 4286 

I II 'I II, ;l II II II M I 

Db 2 82 9 GTTATTCAATTAAAATGCCACATTTCTGGTCTCTGGG 2865 



RESULT 9 
ABQ77402/C 

ID ABQ77402 standard; DNA; 183337 BP. 
XX 

AC ABQ774 02; 
XX 

DT 10-MAY-2003 (first entry) 
XX 

DE Human EDNRB DNA. 
XX 

KW Human; EDNRB; vascular disease; cardiant; antiarteriosclerotic; stroke; 

KW cerebroprotective; gene therapy; coronary artery disease; ischaemia; 

KW myocardial infarction; peripheral vascular disease; pulmonary embolism; 

KW venous thromboembolism; forensic; paternity testing; GI12597038; gene; 

KW SNP; single nucleotide polymorphism; ds . 
XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT variation replace (75672, t) 

FT /*tag= a 

FT /standard_name= "SNP" 

FT /note= "Single nucleotide polymorphism (ID G337a4) 

FT which does not change the EDNBR protein" 

XX 



PN WO2003016494-A2. 
XX 

PD 27-FEB-2003. 
XX 

PF 16-AUG-2002; 2 002WO-US26343 . 
XX 

PR 16-AUG-2001; 2 001US-313 097P . 
PR 05-OCT-2001; 2 001US-3274 8 5P . 
PR 14-DEC-2001; 2001US- 0020141 . 
XX 

PA (VITI-) VITIVITY INC. 
XX 

PI McCarthy J, Ableson A; 



DR WPI; 2003-300617/29. 

DR P-PSDB; ABG74670. 
XX 

PT Identifying a subject as a candidate for a particular course of therapy 

PT to treat a vascular disease or disorder, e.g. stroke, myocardial 

PT infarction or ischemia by determining the identity of the nucleotide 

PT present at specific positions 

XX 

PS Claim 1; Fig 5; 568pp; English. 
XX 

CC This invention describes a novel method for identifying a subject as a 

CC candidate for a particular course of therapy to treat a vascular disease 

CC or disorder. The method comprises determining the identity of the 

CC nucleotide present at specific positions, or their complements, and 

CC identifying the subject as a candidate for a particular clinical course 

CC of therapy based on the identity of the nucleotide present in that 

CC specific position. The method can be used for identifying a subject who 

CC is a candidate for further diagnostic evaluation of a vascular disease or 

CC disorder and selecting a clinical course of therapy. The products of the 

CC invention have cardiant, ant iarteriosclerot ic and cerebroprotective 

CC activity and can be used for gene therapy. The methods disclosed are 

CC useful for treating a vascular disease, e.g. atherosclerosis, coronary 

CC artery disease, myocardial infarction, ischaemia, stroke, peripheral 

CC vascular diseases, venous thromboembolism and pulmonary embolism. The DNA 

CC sequences are useful as fingerprint for detecting different individuals 

CC within the same species applicable in forensic studies and paternity 

CC testing. This sequence encodes the human EDNBR gene represented in 

CC GI12597038, used to illustrate the method of the invention. 

XX 

SQ Sequence 183337 BP; 56451 A; 33595 C; 34663 G; 58628 T; 0 other; 

Query Match 66.1%; Score 2841.8; DB 25; Length 183337; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 2854; Conservative 0; Mismatches 2; Indels 1; Gaps 1; 
Qy 143 0 AGTCATGCTTATG CTG CTGGTG CCAGTCATTTGAAGAAAAACAGTC CTTGGAGGAAAAGC 14 89 

MIIMIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIMIIMIIIIIIIMIIIIIII 

Db 72 83 0 AGTCATGCTTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGC 72771 

Qy 14 90 AGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATA 154 9 



Db 




Qy 




Db 



Qy 




Db 



Qy 




Db 



Qy 



1730 TTATGAGCTGTTTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGA 1789 



Db 7253 0 TTATGAG CTGTTTACGGCATGGAAAGAAAAT CAGTGGGAATTAAGAAAG CCTCGTCGTGA 724 71 

Qy 1790 AAGCACTTAATTTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATT 184 9 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII! 

Db 72470 AAGCACTTAATTTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATT 72411 

Qy 1850 CACACAACACTTAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAG 1909 

MM IMMI II II I II M II III I MM I Ml Ml I 

Db 72410 CACACAACACTTAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAG 723 51 

Qy 1910 ATTTATTTTTAAATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGA 1969 

Mill MMM II II I II MMM III I Ml! Mill M! II 

Db 72350 ATTTATTTTTAAATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGA 72291 

Qy 1970 ACTTTTAAATGAAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTT 2 029 

MM MMM II II M II MMM Ml I IMM Ml Ml II 

Db 722 90 ACTTTTAAATGAAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTT 72231 

Qy 2 030 TC^ATTAATATTATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTT 2 08 9 

I M IMMI II II I II M Ml II I IMM Ml II II 

Db 72230 TCAATTAATATTATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTT 72171 

Qy 2 090 AGTTGTTGCATTTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAG 214 9 

Ml I I ■ ■ ; ;l I 

Db 72170 AGTTGTTGCATTTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAG 72111 

Qy 2150 AGCAAGGCTGTTTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTG 22 09 

IIIIIIIMIIIIIMIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIII 

Db 72110 AGCAAGGCTGTTTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTG 72 051 

Qy 2210 CAATATGTAACCAACATGTCACAAACAAGOVGCATGTAACAGAC^ 2269 

IMM III II I IMM MMMI || in | || MMMil II 

Db 72 05 0 CAATATGTAACCAACATGTCA.CAAACAAGCAGCATGTAACA 71991 

Qy 227 0 TGAATTTAAAATATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAG 232 9 

I I .1 I ! i li ii I: ■ I I 

Db 71990 TGAATTTAAAATATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAG 71931 

Qy 2330 ATCAAACCTCACAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAA 2389 

I M II MMM : MI M II, II II! II M IMM IIMIMI 

Db 71930 ATCAAACCTCACAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAA 71871 

Qy 23 90 TCTGTCATTCACATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAA 244 9 

II MMIMI II III I Ml Ml II Ml MMIM II II Ml II 

Db 7187 0 TCTGTCATTCACATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAA 71811 

Qy 24 50 ATCTTCTTTTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTT 2509 

Mill M I Ml IM MMM M I M Ml IMMM 

Db 71810 ATCTTCTTCTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTT 71751 

Qy 2510 ACCTACATACACTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTAC 2569 

MMI II IM M II I IM MMM II I II IMM II I 

Db 71750 ACCTACATACACTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTAC 71691 

Qy 2570 GATGGAGAGATGCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCAC 2 62 9 

I MM I Ml II II I Ml IM MMI M III Mill II M 



Db 


71690 


Qy 


2630 


Db 


71630 


Qy 


2690 


Db 


71570 


Qy 


2750 


Db 


71510 


Qy 


2810 


Db 


71450 


Qy 


2870 


Db 


71390 


Qy 


2930 


Db 


71330 


Qy 


2990 


Db 


71270 


Qy 


3050 


Db 


71210 


Qy 


3110 


Db 


71150 


Qy 


3170 


Db 


71090 


Qy 


3230 


Db 


71030 


Qy 


3290 


Db 


70970 


Ov 


3350 


Db 


70910 


Qy 


3410 


Db 


70850 



71690 GATGGAGAGATG C CAGTGACCTCATAATAAAGACTGTGAACTG CCTGGTG CAGTGTCCAC 71631 
ATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGC 2689 

IIIIIIIMIIIIIIIIIMIMIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

ATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGC 71571 
ATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTA 274 9 

II M II: M 1 1 1 1 . 1 1 . [ ! 1 1 II N i ! I Mi I 

ATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTA 71511 
ACAG CTACCTGTAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAG CCAATAGAA 28 09 

l! II II IMIIII LMI'IMI; IIIIM |: II II II II Mill 

ACAG CTAC CTGTAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAG CCAATAGAA 71451 
AAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGT 2869 

I MMI MMI MMMI ' MMI MM MM Ml II 

AAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGT 713 91 
AAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCT 292 9 

II II MMMI MmMI ,M MM I II II M Ml MM 

AAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCT 71331 
TAGGATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAG 298 9 

I MMMMMMMMIMM II MM I II IMIMM III 

TAGGATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAG 71271 
GAAATGAGGTGGGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTT 304 9 

I MM MMI IM I II I MMI I III II M MMI 

GAAATGAGGTGGGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTT 71211 
CGTCATTGCCTCGTCACATCAATGCAAA^ 3109 

I MMMMMII II; I II I Mill I Ml II II II II 

CGTCATTGCCTCGTC^CATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACA^ 71151 
CAATGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAA 3169 

IIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIMMIIIIIIIIIIIIIIIMIIIIII 

C^TGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAA 71091 
AATATGCCCAAATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATA 3229 

I II IMM MM, I II I II II I II II II II MM 

AATATGCCCAAATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATA 71031 
AGCTAGTAATGTTGTTTTCTGTC^TATTGAATGTGATGGTAC^GTAAACCAAAACCO^ 328 9 

MMI MM II II M II M II IMMIMMMII III 

AGCTAGTAATGTTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACC^AACCC^ 70971 
CAATGTGGCCAGAAAGAAAGAGCAATAATAATTAATTCACACAC 334 9 

I MM MMIM M MIMIMIMIMIMIMIMMII II II 

CAATGTGGCCAGAAAGAAAGAGCAATAATAATTAATT 70911 
ATAAATCACCCACAAACTTGTTCTTTAAT^ 34 09 

MM MIMI I I II I MM I II M I II II 

ATAAATCACCCACAAACTTGTTCTTTAATTTCATCCCAATCA 70851 
ATCATAGAAGTCATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTT^ 3469 

MIMIMMI II II MM MMMI MIMIMMIMM II 

ATCATAGAAGTCATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCA 70791 



Qy 


3470 


Db 


70790 


Qy 


3530 


Db 


70730 


Qy 


3590 


Db 


70670 


Qy 


3650 


Db 


70610 


Qy 


3710 


Db 


70551 


Qy 


3770 


Db 


70491 


Qy 


3830 


Db 


70431 


Qy 


3890 


Db 


70371 


Qy 


3950 


Db 


70311 


Qy 


4010 


Db 


70251 


Qy 


4070 


Db 


70191 


Qy 


4130 


Db 


70131 


Qy 


4190 


Db 


70071 


Qy 


4250 


Db 


70011 



CAGTTTATTAATATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACT 352 9 

II I M ' ' ! I 

CAGTTTATTAATATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACT 70731 
GAATTTTTACATCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATC 3589 

II ll'ih ill II Mill I II Ml IM M II Ml I 

GAATTTTTACATCCTGATACCTTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATC 70671 
TTGCC^U^TTTTGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCA 364 9 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIMIIIIIIMIIIIIIIIIIM 

TTGCCAAATTTTGAAACTACAC^Cm?WVGCATACTTGC^TTATTTATAATAAAATTGC^ 70611 
TTCAGTGGCTTTTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAA 3709 

llllllllllllll I: I Ml II I II Mi II 'I'MI 'MM 

TTCAGTGGCTTTTT - AAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAA 70552 
AC^TTATAATTTCTTTAC^TACTC^W^CC^GATAGAAAAAGGTGCTATCGTTCAACT 3769 

I Ml I III IM I M II M II II II I MM II II 

ACAATTATAATTTCTTTACATACTCAAAACCAA.GATAGAAAAAGGTGCTATCGTTCAACT 7 04 92 
TCAAAACATGTTTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAA 382 9 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIMII 

TCAAAACATGTTTCCTAGTATTAAGGACTTTAATATAGC^ACAGACAAAATTATTGTTAA 70432 
CATGGATGTTACAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTAT 3 88 9 

II II I II Ml M IM II IM III II MM I IMMM M 

CATGGATGTTACAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTAT 70372 
CCACTGCTAATGTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCC 394 9 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIM 

CCACTGCTAATGTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCC 70312 
AAAGGAATACAGTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAA 4 009 

M MMI MIMI MMI III IMM MM II MMI MM I 

AAAGGAATACAGTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAA 70252 
TATAACAATGTAAAAAATTATATATCTGGGAGGATTTTTTGGTTG CCTAAAGTGG CTATA 4 069 

I II I II IMM II II II III M Ml M II IMM 

TATAACAATGTAAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATA 70192 
GTTACTGATTTTTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTA 4129 

IM IMM MMMI IMM IM MMI MM IM Mill II I 

GTTACTGATTTTTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTA 70132 
CCTTATTTTTCACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAA 4189 

I II I II IM I II Ml II M II Mi II II II M 

CCTTATTTTTCACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAA 70072 
TATAAATGTGAC^GTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAA 424 9 

M M M MM II I Ml II II I III II M'MI II Ml 

TATAAATGTGACAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATG^ 70012 
GTTATTCAATTAAAATGCCACATTTCTGGTCTCTGGG 4286 

1 I I i II ' :l ' 

GTTATTCAATTAAAATGCCACATTTCTGGTCTCTGGG 69975 



RESULT 10 
AAV17875 

ID AAV17875 standard; cDNA to mRNA; 1873 BP. 
XX 

AC AAV17875; 
XX 

DT 03-AUG-1998 (first entry) 
XX 

DE Homo sapiens endothelin B receptor gene. 
XX 

KW dystroglycan; hDG; progressive muscular dystrophy; 

KW endothelin B receptor; alpha -sarcoglycan; Hirschsprung's disease; ds . 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 44 . .12 04 

FT /*tag= a 

FT /product= dystroglycan 

XX 

PN EP825265-A1. 
XX 

PD 25-FEB-1998. 
XX 

PF 14-AUG-1997; 97EP-0114 019 . 
XX 

PR 16-AUG-1996; 96 JP- 02 16506 . 
XX 

PA (RIKA ) INST PHYSICAL & CHEM RES. 

PA (RIKA ) RIKAGAKU KENKYUSHO. 

XX 

PI Hanaoka F, Sakamoto A; 
XX 

DR WPI; 1998-132258/13. 
XX 

PT cDNA reverse transcription and amplification - using different 

PT reverse transcription and 3' amplification primers 

XX 

PS Example 3; Page 19-21; 25pp; English. 
XX 

CC The sequence is that encoding human endothelin B receptor which is 

CC implicated in Hirschsprung's disease. 

XX 

SQ Sequence 1873 BP; 490 A; 434 C; 438 G; 511 T; 0 other; 

Query Match 39.3%; Score 1691.8; DB 19; Length 1873; 

Best Local Similarity 99.6%; Pred. No. 0; 

Matches 1696; Conservative 0; Mismatches 7; Indels 0; Gaps 0; 
Qy 178 TGAAACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGC 237 

II M IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIII 

Db 171 TGTCTCTAGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGC 23 0 

Qy 238 ATGCAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 2 97 

IMIIIIIIIIIMIIIIIIIIIIIIII IIIIIIIMIIIIIIIIIIIIIIIIIIIII 

Db 231 ATGCAGCCGCCTCCAAGTCTGTGCGGACCGGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 2 90 



Qy 


298 


Db 


291 


Qy 


358 


Db 


351 


Qy 


418 


Db 


411 


Qy 


478 


Db 


471 


Qy 


538 


Db 


531 


Qy 


598 


Db 


591 


Qy 


658 


Db 


651 


Qy 


718 


Db 


711 


Qy 


778 


Db 


771 


Qy 


838 


Db 


831 


Qy 


898 


Db 


891 


Qy 


958 


Db 


951 


Qy 


1018 


Db 


1011 


Qy 


1078 


Db 


1071 



1 1 1 1 i 1 1 1 1 1 1 1 1 1 1: 1 1! I.! 1 1 1 1 1 MM 1 1 II II 1 1 1 1 li 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



CAAACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCC 417 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

CAAACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCC 410 
AGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCT 477 

1 1 i 1 1 1 1 1 M ! 1 1 1 1 1 1 1 1 ! I III 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 ! 1 1 1 ! I ! 1 1 1 1 ! i 1 1 1 

AGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCT 470 
CCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 537 

MIMIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIMIMIMIIIIIIIIIII 

CCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 530 
AAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCC 597 

■ I I i i . . ■ ! : i ■ : I I I i I I i : I 

AAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCC 590 
ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 657 

I lllllllll MM IMI 11,11 I HI II Ml II Mill Ml lllllllll 

ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 65 0 
GCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTAC 717 

I I I Mi; , ■ i : I ; i ■ . : ,i 

GCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTAC 710 

AAGCTGCTGG CAGAGGACTGG C CATTTGGAG CTGAGATGTGTAAGCTGGTG CCTTTCATA 777 

I I I I i I I I I I I . I i I . I I I : , I I i I I i : I I , : I 

AAGCTGCTGGCAGAGGACTGG C CATTTGGAG CTGAGATGTGTAAGCTGGTG CCTTTCATA 77 0 

CAGAAAG CCTCCGTGGGAATCACTGTGCTGAGTCTATGTG CTCTGAGTATTGACAGATAT 837 

1 1 1 1 I 1 1 ' : 1 1 1 M 1 1 II II . I 

CAGAAAG CCTCCGTGGGAATCACTGTG CTGAGTCTATGTG CTCTGAGTATTGACAGATAT 830 
CGAGCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTA 897 

MIIIIMMMMMIMIMMIIIIMIMIMMMMMMMIIMMIIMM 

CGAGCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTA 890 
GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 957 

ii imi uiiii. M ii iin ii ii i ii ih i ui'iin 

GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 950 
GATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTT 1017 

MMIMMMIIIIIIMMIIMIIMIMMMMMMMMIMIMMIIMM 

GATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTT 1010 
CAGAAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTC 1077 

IIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM lllllllll 

CAGAAGAC^GCTTTCATGCAGTTTTACAAGACAGCAA^ 1070 
TATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATG 1137 

MIIIMIIIIIIIMMMIIIIIMIIMIIIIMIIMIMIIMIIIIIIIIMII 

TATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATG 1130 



Qy 


1138 


Db 


1131 


Qy 


1198 


Db 


1191 


Qy 


1258 


Db 


1251 


Qy 


1318 


Db 


1311 


Qy 


1378 


Db 


1371 


Qy 


1438 


Db 


1431 


Qy 


1498 


Db 


1491 


Qy 


1558 


Db 


1551 


Qy 


1618 


Db 


1611 


Qy 


1678 


Db 


1671 


Qy 


1738 


Db 


1731 


vy 


1 7QQ 


Db 


1791 


Qy 


1858 


Db 


1851 



TTGAGAAAGAAAAGTGG CATG CAGATTG CTTTAAATGATCACCTAAAGCAGAGACGGGAA 1197 

1 1 ■ I 1 1 : ! I i I : : : i ■ ■ I 1 1 : I ^ ■ 

TTGAGAAAGAAAAGTGG CATG CAGATTG CTTTAAATGATCACCTAAAGCAGAGACGGGAA 1190 
GTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCAC 1257 

i 1 i : i ■ . I I , ' I ' i I ' ■ : I ! I ' : | | ! 

GTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCAC 1250 
CTCAGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTT 1317 

llllllillMIIIMIilllMlllllilllllllllllMMIIIMMIIIIIIIII 

CTCAGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTT 1310 
TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 1377 

II I IM M III! I: II IMI II II; M MM MM 

TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 1370 
ATTAACCCAATTG CTCTGTATTTGGTGAG CAAAAGATTCAAAAACTG CTTTAAGT CATG C 1437 

MIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIMI 

ATTAACCCAATTG CTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGC 1430 
TTATG CTGCTGGTGC CAGTCATTTGAAGAAAAACAGT CCTTGGAGGAAAAGCAGTCGTGC 14 97 

II II III II III M II MUM MIMII III IMMM IMM 

TTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGC 14 90 
TTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCG 1557 

IMIIMIIIIIIII MIMMIMMIMMMMIIMIMM IMIIMIMIIIII 

TTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGC 1550 
TCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCAT 1617 

IM I M M II I II M M II II IM I II ill II 

TCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCAT 1610 
TAAAACAAAATGAAACATTTGCCAAAACAAAACA^ 1677 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TAAAACAAAATGAAACATTTGCCAAAACAAAAC^^ 1670 
TATTAAAATATTAAGTGTAATTATTTTAACACTCAC^GCTACATATGAa^TTTTATGAGC 1737 

II M II MM II M MM II I II Mi Ml I II II iM 

TATTAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGC 1730 
TGTTTACGG CATGGAAAGAAAATCAGTGGGAATTAAGAAAG CCT CGTCGTGAAAG CACTT 1797 

II I Ml IMI II II MM II M II IM III I II Ml II 

TGTTTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTT 1790 
AATTTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCC^ 1857 

M I II M IMM II II M II IM IM I II II IM 

AATTTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCC^GGATATTCAC^a^C 1850 



I I I I I I i I I I [ I i I I I I 



RESULT 11 
AAF21283 

ID AAF21283 Standard; DNA; 1872 BP. 
XX 



AC AAF21283; 
XX 

DT 14-MAR-2001 (first entry) 
XX 

DE Human low adenosine antisense oligonucleotide related sequence #2850. 
XX 

KW Low adenosine antisense oligonucleotide; phosphorothioate; allergy; 

KW human; airway disorder; bronchoconstriction; lung inflammation; 

KW surfactant depletion; respiratory; bronchodilator ; antiinflammatory; 

KW immunosuppressive; antiasthmatic; analgesic; hypotensive; cytostatic; 

KW respiratory obstruction; pulmonary obstruction; impeded respiration; 

KW surfactant hypoproduction; pulmonary vasoconstriction; asthma; RDS; 

KW respiratory distress syndrome; pain; cystic fibrosis; allergic rhinitis; 

KW pulmonary hypertension; emphysema; pulmonary transplantation rejection; 

KW chronic obstructive pulmonary disease; pulmonary infection; bronchitis; 

KW cancer; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200062736-A2 . 
XX 

PD 26-OCT-2000. 
XX 

PF 24-MAR-2000; 2 000WO-US08 02 0 . 
XX 

PR 06-APR-1999; 99US-0127958 . 
XX 

PA (UYEC-) UNIV EAST CAROLINA. 

PA (NYCE/) NYCE J W. 

XX 

PI Nyce JW; 
XX 

DR WPI; 2000-679539/66. 
XX 

PT Low adenosine (A) content antisense oligonucleotides which do not 

PT trigger adenosine receptors during metabolism, useful e.g. for treating 

PT cancers and respiratory obstructions - 

XX 

PS Disclosure; Page 1273; 1592pp ; English. 
XX 

CC The present invention describes low adenosine (A) content antisense 

CC oligonucleotides and compositions (I) comprising them. In the antisense 

CC oligonucleotides the A is replaced by a 'Universal' or alternative base. 

CC (I) can have respiratory, bronchodilator, antiinflammatory, analgesic, 

CC immunosuppressive, antiasthmatic, hypotensive and cytostatic activities. 

CC The antisense oligonucleotides and (I) can be used to down-regulate the 

CC expression and or activity of target polypeptides associated with 

CC lung/respiratory disorders and malignancies, such as stimulating and 

CC activating peptide factors and transmitters, transcription factors, 

CC immunoglobulins and antibodies, antibody receptors, cytokines and 

CC chemokines, endogenously produced specific and non-specific enzymes, 

CC binding proteins, adhesion molecules and their receptors, cytokine and 

CC chemokine receptors, adenosine receptors, bradykinin receptors, central 

CC nervous system (CNS) and peripheral nervous and non-nervous system 

CC receptors, CNS and peripheral nervous and non-nervous system peptide 

CC transmitters, defensins, growth factors, vasoactive peptides and 

CC receptors, binding proteins and malignancy associated proteins. The 



CC antisense oligonucleotides may be used in this way to treat disorders 

CC including respiratory obstruction (especially pulmonary obstruction 

CC and/or bronchoconstrict ion) and/or lung inflammation, allergy(ies) 

CC and/or surfactant hypoproduction which are associated with a disease or 

CC condition selected from pulmonary vasoconstriction, inflammation, 

CC allergies, asthma, impeded respiration, respiratory distress syndrome 

CC (RDS) , pain, cystic fibrosis (CF) , allergic rhinitis (AR) , pulmonary 

CC hypertension, emphysema, chronic obstructive pulmonary disease (COPD) , 

CC pulmonary transplantation rejection, pulmonary infections, bronchitis, 

CC and/or cancer. AAF18434 to AAF21543 represent human polynucleotide 

CC fragments and antisense oligonucleotides used in the exemplification of 

CC the present invention. 

XX 

SQ Sequence 1872 BP; 489 A; 434 C; 438 G; 511 T; 0 other; 

Query Match 3 9.3%; Score 1690.8; DB 21; Length 1872; 

Best Local Similarity 99.6%; Pred. No. 0; 

Matches 1695; Conservative 0; Mismatches 7; Indels 0; Gaps 0; 

Qy 178 TGAAACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGC 237 

II M llllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 171 TGTCTCTAGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGC 230 

Qy 238 ATGCAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 2 97 

llllllllllllllllllllllllllll llllllllllllllllllllllllllllll 
Db 231 ATGCAGCCGCCTCCAAGTCTGTGCGGACCGGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 2 90 

Qy 2 98 CTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTG 357 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 291 CTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTG 350 

Qy 358 CAAACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCC 417 

MIIIIIMIIIMIIIIMMIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 351 CAAACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCC 410 

Qy 418 AGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCT 4 77 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMI 

Db 411 AGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCT 470 

Qy 478 CCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 537 

■ i I I I : i ■ i I ; ! I ' ■ : i . 

Db 4 71 CCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 530 

Qy 538 AAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCC 597 

M IM II MM II MM II IMMI II Ml MM II Ml I 

Db 531 AAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCC 5 90 

Qy 598 ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 657 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIMIIIIMI 

Db 5 91 ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 650 

Qy 658 GCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTAC 717 

IM IMMI MM IMIM II IMMI Ml Mill! II Ml M 

Db 651 GCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTAC 710 

Qy 718 AAGCTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATA 777 

Ml IMMI IIMIIIMII IMMIMIM II Ml MMIIIIMI I 



Db 



711 AAG CTGCTGGCAGAGGACTGG CCATTTGGAG CTGAGATGTGTAAG CTGGTG CCTTTCATA 770 



Qy 


778 


Db 


771 


Qy 


838 


Db 


831 


Qy 


898 


Db 


891 


Qy 


958 


Db 


951 


Qy 


1018 


Db 


1011 


Qy 


1078 


Db 


1071 


Qy 


1138 


Db 


1131 


Qy 


1198 


Db 


1191 


Qy 


1258 


Db 


1251 


Qy 


1318 


Db 


1311 


Qy 


1378 


Db 


1371 


Qy 


1438 


LJkJ 


14 71 
11 ji. 


Qy 


1498 


Db 


1491 


Qy 


1558 


Db 


1551 



CAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATAT 837 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIMMIIMIIMIIIIIIIIIIIIIIIIIII 

CAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATAT 83 0 
CGAGCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTA 897 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIII 

CGAGCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTA 890 
GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 957 

IIMIIIIIIIIIIIMIMMMIIIIIIMIMIIMIIIIIIIIIIIIII lllllll 

GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 95 0 
GATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTT 1017 

M I Ml M I M I II I Ml Ml M II M II MMM 

GATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTT 1010 
CAGAAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTC 1077 

lllllillllllll II I Ml I lllllll III III I Mil IIIIIIMM lllllllll 

CAGAAGACAG CTTTCATGCAGTTTTACAAGACAG CAAAAGATTGGTGG CTATTCAGTTTC 1070 
TATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATG 1137 

M II II! II IM MIMIMI II IMM! IMM II II II II 

TATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATG 1130 
TTGAGAAAGAAAAGTGG CATG CAGATTG CTTTAAATGATCACCTAAAGCAGAGACGGGAA 1197 

MMIIIIIMIIIMIMIIIIIIIIIIMIIIIIIIIIIMMIIIIMIIIIIIIII 

TTGAGAAAGAAAAGTGG CATG CAGATTG CTTTAAATGATCACCTAAAG CAGAGACGGGAA 1190 
GTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCAC 1257 

I! I II II I Ml I II M IMMM M II Mill MUM 

GTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCAC 1250 
CTCAGCAGGATTCTGAAGCTC^CTCTTTATAATCAGAATGATCCCAATAGATGTGAACTT 1317 

I MM! MM I MM I IM IMM IMM II M IMMM I 

CTCAG CAGGATT CTGAAGCTCACTCTTTATAATCAGAATGATC CCAATAGATGTGAACTT 1310 
TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 1377 

II Ml IM IMM II M ! II MMM II M MMII IM II I 

TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 1370 
ATTAACCCAATTGCTCTGTATTTGGTGAGC^VAAAGATTCAAAAACTGCTTTAAGTCATGC 1437 

M IMM IM I Ml i II, IMM III I II IMM Ml I 

ATTAACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGT 1430 
TTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGC 14 97 

MMI II II I Ml M II Ml II II M MMill III II 

TTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGC 14 90 
TTAAAGTTCAAAG CTAATGATCACGGATATGACAACTTCCGTT CCAGTAATAAATACAG C 1557 

M III IMIIMM M M IM IMM MM I II IMM Ml I 

TTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGC 155 0 
TCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCAT 1617 

M IM I IMM, !M MMII M MMM M II IMM MMM 

TCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCAT 1610 



Qy 1618 TAAAACAAAATGAAACATTTGCC^^ 1677 

IIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1611 TAAAACAAAATGAAACA^ 1670 

Qy 1678 TATTAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGC 1737 

- ; . ; ; i i i ! . : ; i i 1 , 

Db 1671 TATTAAAATATTAAGTGTAATTATTTTAACACT CACAG CTACATATGACATTTTATGAGC 173 0 

Qy 1738 TGTTTACGG CATGGAAAGAAAAT CAGTGGGAATTAAGAAAG CCTCGTCGTGAAAGCACTT 1797 

II M I I I : II: I IN' I I II i II! I I 

Db 1731 TGTTTACGG CATGGAAAGAAAAT CAGTGGGAATTAAGAAAG CCTCGTCGTGAAAGCACTT 1790 

Qy 1798 AATTTTTTACAGTTAGCACTTCAACATAGCTCTTAACA^ 1857 

IIIIIIIIIIIIIIMIIMIMIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIII 

Db 1791 AATTTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATTCACACAAC 1850 

Qy 1858 ACTTAGGCTTAAAAATGAGCTC 187 9 

Db 1851 ACTTAGGCTTAAAAATGAGCTC 1872 



RESULT 12 
AAA35161 

ID AAA35161 standard; DNA; 1872 BP. 
XX 

AC AAA35161; 
XX 

DT 28-JUL-2000 (first entry) 
XX 

DE Human adenosine receptor related polynucleotide 2nd SEQ ID NO: 35, 
XX 

KW Human; adenosine receptor; low adenosine antisense oligonucleotide; 

KW phosphorothioate; impaired respiration; inflammation; allergy; 

KW allergic disease; bronchoconstriction; inhibitor; antiinflammatory; 

KW antiallergic; antiasthmatic; cytostatic; analgesic; impaired airway; 

KW lung disease; ischaemic condition; pulmonary vasoconstriction; asthma; 

KW respiratory distress syndrome; pain; cystic fibrosis; emphysema; 

KW pulmonary hypertension; chronic obstructive pulmonary disease; COPD; 

KW cancer; leukaemia; lymphoma; carcinoma; metastasis; ss . 

XX 

OS Homo sapiens. 
XX 

PN WO200009525-A2 . 
XX 

PD 24-FEB-2000. 
XX 

PF 03-AUG-1999; 99WO-US17712 . 
XX 

PR 03-AUG-1998; 98US-00952 12 . 
XX 

PA (UYEC-) UNIV EAST CAROLINA. 
XX 

PI Nyce JW; 
XX 

DR WPI; 2000-205971/18. 
XX 



PT New antisense oligonucleotides useful for treating e.g. pulmonary 

PT vasoconstriction, inflammation, allergies, asthma, hypertension, 

PT bronchitis, emphysema, respiratory distress syndrome, ischemia or 

PT cancers 
XX 

PS Disclosure; Page 1190-1191; 1343pp; English. 
XX 

CC The present invention describes a new composition comprising an 

CC antisense oligonucleotide (ON) with low adenosine (up to 15%) , which 

CC targets nucleic acids involved in bronchoconstriction, allergies, and/or 

CC inflammation. The ON can have antiinflammatory, antiallergic, 

CC antiasthmatic, cytostatic and analgesic activities. The compositions are 

CC useful for the treatment of diseases associated with inflammation, 

CC impaired airways, including lung disease and diseases whose secondary 

CC effects afflict the lungs of a subject. They can be used for treating 

CC e.g. ischaemic conditions, pulmonary vasoconstriction, allergies, 

CC asthma, impeded respiration, respiratory distress syndrome, pain, cystic 

CC fibrosis, pulmonary hypertension, emphysema, chronic obstructive 

CC pulmonary disease (COPD) , and cancers such as leukaemias, lymphomas, 

CC carcinomas, and cancers which may metastasise to the lungs, including 

CC breast and prostate cancer. The reduction of the adenosine content of 

CC the ONs reduces side effects. The A-containing ONs break down with the 

CC release of deoxyadenosine which activates adenosine receptors causing 

CC bronchoconstriction and inflammation. AAA32313 to AAA35312 represent the 

CC nucleotide sequences given in the sequence listing from the present 

CC invention, which correspond to SEQ ID NO:l to 2815, and then the last 

CC 185 sequences are also called SEQ ID N0:1 to 185, but the sequences 

CC differ from the previously named sequences. SEQ ID NO: 11 to 1680 

CC (AAA32323 to AAA33992) are specifically claimed ONs from the present 

CC invention. N.B. Sequences given in the disclosure of the present 

CC invention do not match up with their corresponding SEQ ID NO: sequences 

CC given in the sequence listing. 

XX 

SQ Sequence 1872 BP; 489 A; 434 C; 438 G; 511 T; 0 other; 

Query Match 39.3%; Score 1690.8; DB 21; Length 1872; 

Best Local Similarity 99.6%; Pred. No. 0; 

Matches 1695; Conservative 0; Mismatches 7; Indels 0; Gaps 0; 

Qy 178 TGAAACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGC 237 

II M llllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 171 TGTCTCTAGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGC 23 0 

Qy 238 ATGCAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 297 

i I I ! ■ : i 1 I llllllllllllllllllllllllllllll 

Db 231 ATGCAGCCGCCTCCAAGTCTGTGCGGACCGGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 290 

Qy 2 98 CTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTG 357 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 2 91 CTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTG 350 

Qy 358 CAAACCGCAGAGATAATGACGCCACCCACTAAGACCTT^ 417 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 351 CAAACCGCAGAGATAATGACGCC^CCC^CTAAGACCTTATGGCCC^GGGTTCC^ACGCC 410 

Qy 418 AGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCT 477 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMI 



Db 


411 


AGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGAPGGCAGGATPT 

**vj J- w x vjvj v ~i ^vjo x vj x x vjvjv^rivv^ x vj vvjvjnvj \ j x vj v_ v_ x _a-£_i_v_j vjri\jii.v ta ^riVji\jxiV v_J V_J v__CiVjJVJJxi. X v_» X 


47 0 


Qy 


478 


CCGCCACGCACCATCTCCCCTCCCCCX3TGCCAAGGACCCATCGAGATCAAGGAGACTTTC 


537 






MMMMMIMMIIMM MIMMMMMMMMMMMMIMM llllll 




Db 


471 


PPG C PA PGPA CCATCTCCCCTCCCCCCFVCICC A AGP A PPP A TPP AP A TP A A PP A PAPTTTP 

V_ V_V_J V_ V__T^. V_V_J v_~£xV — \ — f^. XV — X \ — \ V_ v_ X v_ v_ v_ V < v_T X v_J v_ v_-T^_iV_T v_J.£i V \ v_-fA. X V_\Jrt.\iZrt. J. V if^f^V.7V7rtV^f7 V., XXX ^ 


_J J VJ 


Qy 


538 


AAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCC 


597 






MMMMMMIMIIMM IIMIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIII 




Db 


53 1 


AAATAPATPAAPAPGGTTGTGTPPTPPPTTPTPTTPPTPPTPPPPATPATPPPPAAPTPP 




Qy 


598 


ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 


657 






IIIIIIIIIIIIIMIIIIII IIIMIMIIMIilllllllllllllMIIMIIIII! 




Db 


591 


ACACTTCTGAGAATTAT CTAPAAGAAPAAGTG CATG PGAAA PGGTPP P A AT ATPTTP A TP 


D _J VJ 


Qy 


658 


GCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTAC 


717 






III Ml IMIMIIIIMIIMIMII! Ill 1 II! Mill II lllllll MM llllll! 




Db 


651 


GCCAGPTTGGPTPTGGGAGAPPTPPTPPAPATPGTPATTPAPATPPPTATPAATPTPTAP 


71 n 


Qy 


718 


AAGCTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATA 


777 






1 II Tl II 1 1 1 1 1 1 1 II 1 1 H 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIMI 1 1 II 1 1 1 1 1 II 




Db 


711 


AAGCTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATA 


770 


Qy 


778 


CAGAAAG CCTC CGTGGGAAT CACTGTG CTGAGTCTATGTG CTCTGAGTATTGACAGATAT 


837 






Mill lllllllllllllll llllllll IIIIIIIIIIMIIIIIIIIIIIIIIIII 




Db 


771 


CAGAAAGCCTPPGTGGGAATPAPTGTPPTGAPTPTATPTPPTPTPAPTATTPAPAPATAT 

v — nwruirivj v_ v_ x \_ >wVJ x \j \j\j^xn. x v — n.\ — x v_T X u\ — X V_T.fl.v_T X v_ X iri X VJ X vJ v_ X v_, X V_T.tt.v_J X -rt. X X Vjrt v_-tt.v_r.tt. x .tt. X 


ft 7 n 

o o u 


Qy 


838 


CGAGCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTA 


897 


Db 


831 


IIMIIMM lllllll MM Ml MM Mill II lllllll MM III III! IIMM 

PGAG PTG TTG PTT PTTPG A P T A P A A TT A AAPPAA TTPPPPTT P P A A A A TPP A P A P P A PT A 

v — vj.ii.v_J v — x V_j x x VJJ v — X X \ — XX v_T\_xtt,V_J X _ri\_J_r_rt. X Xtt_r__^wV_Xtt_r_, X X OVJVJvJ X X V . V. r\,r\r\r\ X V A7"V ■■^lV7V.irti_7 X 


ft q n 
o y u 


Qy 


898 


GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 


957 






MIMIMIM IIMM MM III MM lllllll lllllll MM III MM MM II 




Db 


891 


GAAATTGTTTTGATTTGGGTGGTPTPTGTGGTTPTGGPTGTPPPTGAAGPPATAPPTTTT 

x x vj x x x x \_j_n. xxx x vjvj x v_, x v_, x vj x x x v_. x vjvjv_. x \j x v_. v_v_, x v_ltt_r-v_T v_, X _ri.v_JVJ X X X X 


qr n 

_7 U 


Qy 


958 


GATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTT 


1017 






MMMIMIMM II 1 lllllll III! II! MM IMMMIIM III MM llllll 




Db 


951 


GATATAATTACGATGGACTAPAAAGGAAGTTAT PTGPGAATPTG PTTG PTTPA TPPPPTT 


1 U 1 u 


Qy 


1018 


CAGAAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTC 


1077 






III Ml llllllll lllllll III MM III MM II Mill M II III lllllllll 




Db 


1011 


CAGAAGAPAGPTTTPATGPAPTTTTAPA AP A PAPPA AAAPA TTPPTPPPT A TT PAPTTTP 

v^iunnun v — r\\j v_ xxx v—ri x v_J\ — cWJ XXX 1 rt\^_Ti^UA vJiUv- J^r\r\r\\jr\ X X \J\J X UOL Xin. X X v__ttv_T i 1 XL 


1 070 


Qy 


1078 


TATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATG 


1137 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 




Db 


1071 


TATTTPTGPTTGPPATTPPPPATPA PTPPATTTTTTT A T A PA PT A A TP A PPTPTP Z_ Z_ Z_ TP 

J-« x x x\_x\Jv_X X v_J V_ V — £-\ X X vjvj V_ V t\ x V_J-_v_ X OLri X X X X X X ±J-\ X _~i.v__rt.V_. X X v_J_T.v_.v_ lul \jr\r\r\ X \J 




Qy 


1138 


TTGAGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAA 


1197 






Ml Ml MIMMIMI lllllll MM III MM llllll 1 MM III MM MM III 




Db 


1131 


TTGAGAAAGAAAAGTPPPATGPAPATTPPTTTA A A TP A TP A PPT A A APP A PAPA (^CiCiCiA a 

x x vj^^vj^T-^T-r^vji^-rT-r^jrivj x v_jvj\ — rt. X Ov*r_Ort x X v_jV_ X X XinJ-ijri X X v_-f~_V_, v_, 1 r_Hriu v-ftuiiuHLvAJOHfi 




Qy 


1198 


GTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTPPPPTTPAP 


1 0 R7 






1 MIIIIMIMM II llllllll IMMIMMIMM IMIMIIIIMI! IIMM 




Db 


1191 


GTGGCGAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCAC 


1250 


Qy 


1258 


CTCAGC_AGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTT 


1317 






IMIIIIIIIMIIIMMIIMIIIIIIIIIMIMIIIMIIMIIMIIIIIMIII 




Db 


1251 


CTCAG(_^GGATTCTGAAGCTC1ACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTT 


1310 



Qy 1318 TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 1377 

I I; III ilili II I II M li::ll I IMh Ml Mill 

Db 1311 TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 1370 

Qy 1378 ATTAACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGC 1437 

MIM IM MMI M I II Ml II! M : I II M III M II 

Db 1371 ATTAACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGC 143 0 

Qy 1438 TTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGC 14 97 

MM II MM! !l I II Ml IM IM I II M Ml II II 

Db 1431 TTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGC 14 90 

Qy 14 98 TTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGC 1557 

I M ! 1 1 1 1 1 1 IM 1 1 1 1 Mill II MM II 1 1 1 1 1 1 M II 1 1 II 1 1 1 1 1 1 II I III ! 

Db 14 91 TTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGC 1550 

Qy 1558 TCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCAT 1617 

M I II Ml MMI I II IM I IM I IMMHI IM 

Db 1551 TCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCAT 1610 

Qy 1618 TAAAACAAAATGAAACATTTGCCAAAACAAAACAAAA^ 1677 

1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 

Db 1611 TAAAACAAAATGAAACATTTGCCAAAACAAAACAAAAA 1670 

Qy 1678 TATTAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGC 1737 

I III IMMIMI II I Ml IM II II I II I Ml IM I 

Db 1671 TATTAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGC 1730 

Qy 1738 TGTTTACGG CATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCT CGTCGTGAAAG CACTT 1797 

I MUM IM I M I II I! M III I Ml I II Ml I 

Db 1731 TGTTTACGG CATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCT CGTCGTGAAAG CACTT 1790 

Qy 1798 AATTTTTTACAGTTAGCACTTCAACATAGCTCTTA 1857 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I M 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 M M 1 1 1 1 M 1 1 

Db 1791 AATTTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTC 1850 

Qy 1858 ACTTAGGCTTAAAAATGAGCTC 1879 

I MMI MMI M 

Db 1851 ACTTAGGCTTAAAAATGAGCTC 1872 



RESULT 13 
AAF21285 

ID AAF21285 Standard; DNA; 1719 BP. 
XX 

AC AAF21285; 
XX 

DT 14-MAR-2001 (first entry) 
XX 

DE Human low adenosine antisense oligonucleotide related sequence #2852. 
XX 

KW Low adenosine antisense oligonucleotide; phosphorothioate; allergy; 

KW human; airway disorder; bronchoconstriction; lung inflammation; 

KW surfactant depletion; respiratory; bronchodilator; antiinf lammatory; 

KW immunosuppressive; antiasthmatic; analgesic; hypotensive; cytostatic; 

KW respiratory obstruction; pulmonary obstruction; impeded respiration; 



KW surfactant hypoproduction; pulmonary vasoconstriction; asthma; RDS; 

KW respiratory distress syndrome; pain; cystic fibrosis; allergic rhinitis; 

KW pulmonary hypertension; emphysema; pulmonary transplantation rejection; 

KW chronic obstructive pulmonary disease; pulmonary infection; bronchitis; 

KW cancer; ss. 

XX 

OS Homo sapiens . 
XX 

PN WO200062736-A2 . 
XX 

PD 26-OCT-2000. 
XX 

PF 24-MAR-2000; 2000WO-US08020 . 
XX 

PR 06-APR-1999; 9 9US- 0 12 7 958 . 
XX 

PA (UYEC-) UNIV EAST CAROLINA . 

PA (NYCE/) NYCE J W. 

XX 

PI Nyce JW; 
XX 

DR WPI; 2000-679539/66. 
XX 

PT Low adenosine (A) content antisense oligonucleotides which do not 

PT trigger adenosine receptors during metabolism, useful e.g. for treating 

PT cancers and respiratory obstructions - 

XX 

PS Disclosure; Page 1274-1275; 1592pp; English. 
XX 

CC The present invention describes low adenosine (A) content antisense 

CC oligonucleotides and compositions (I) comprising them. In the antisense 

CC oligonucleotides the A is replaced by a 'Universal' or alternative base. 

CC (I) can have respiratory, bronchodilator, antiinflammatory, analgesic, 

CC immunosuppressive, antiasthmatic, hypotensive and cytostatic activities. 

CC The antisense oligonucleotides and (I) can be used to down-regulate the 

CC expression and or activity of target polypeptides associated with 

CC lung/respiratory disorders and malignancies, such as stimulating and 

CC activating peptide factors and transmitters, transcription factors, 

CC immunoglobulins and antibodies, antibody receptors, cytokines and 

CC chemokines, endogenously produced specific and non-specific enzymes, 

CC binding proteins, adhesion molecules and their receptors, cytokine and 

CC chemokine receptors, adenosine receptors, bradykinin receptors, central 

CC nervous system (CNS) and peripheral nervous and non-nervous system 

CC receptors, CNS and peripheral nervous and non-nervous system peptide 

CC transmitters, defensins, growth factors, vasoactive peptides and 

CC receptors, binding proteins and malignancy associated proteins. The 

CC antisense oligonucleotides may be used in this way to treat disorders 

CC including respiratory obstruction (especially pulmonary obstruction 

CC and/or bronchoconstriction) and/or lung inflammation, allergy(ies) 

CC and/or surfactant hypoproduction which are associated with a disease or 

CC condition selected from pulmonary vasoconstriction, inflammation, 

CC allergies, asthma, impeded respiration, respiratory distress syndrome 

CC (RDS) , pain, cystic fibrosis (CF) , allergic rhinitis (AR) , pulmonary 

CC hypertension, emphysema, chronic obstructive pulmonary disease (COPD) , 

CC pulmonary transplantation rejection, pulmonary infections, bronchitis, 

CC and/or cancer. AAF18434 to AAF21543 represent human polynucleotide 

CC fragments and antisense oligonucleotides used in the exemplification of 



CC the present invention. 
XX 

SQ Sequence 1719 BP; 444 A; 400 C; 412 G; 463 T; 0 Other; 

Query Match 34.8%; Score 1495.4; DB 21; Length 1719; 

Best Local Similarity 99.8%; Pred. No. 2.5e-303; 

Matches 1503; Conservative 0; Mismatches 2; Indels 1; Gaps 1; 

GCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATGCAGCCGCCTCCAAGTCTGTGCG 262 

IIMIIIIIIIMI IIIMMMIIIIIIIIIIIIIIIMIIIIMIMIMIIIIIII 

G CGGCCAC CGGACG - CTTCTGGAGCAGGTAGCAGCATGCAGCCGCCTCCAAGTCTGTGCG 258 
GACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTGTCGCGGATCTGGGGAGAGGAGA 322 

IIIIIIIIMIIIIIIIIIIIMIIIIIIMIIIIIIMIIIIIIIIIIMMIIIIIII 

GACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTGTCGCGGATCTGGGGAGAGGAGA 318 
GAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAAACCGCAGAGATAATGACGCCAC 382 

IMIIMIIIIII IIIIIIIIIIIIIMIIIIIIIIIIIIIIMIMIIIIIIMIIII 

GAGGCTTCCCGCCCGACAGGGCCACTCCGCTTTTGCAAACCGCAGAGATAATGACGCCAC 378 
CCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGTCTGGCGCGGTCGTTGGCACCTG 442 

IIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIMMIIIMIIIIIIIIIIIIIII 

CCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGTCTGGCGCGGTCGTTGGCACCTG 438 
CGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCGCCACGCACCATCTCCCCTCCCC 502 

! ^ I ■ 1 I i 1 1 ' , 1 1 i ; I ' i 1 , , 1 ! 

CGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCGCCACGCACCATCTCCCCTCCCC 4 98 
CGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATCAACACGGTTGTGTCCT 562 

I II M Mill I II IN I II II I II IN I MUM 

CGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATCAACACGGTTGTGTCCT 558 
GCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGA 622 

IIIIMIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

GCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGA 618 
ACAAGTGCATG CGAAACGGTCCCAATATCTTGATCG CCAG CTTGG CTCTGGGAGACCTG C 682 

I IM I Mill I II !M I I! IM I MIMM IMIM I I 

ACAAGTGCATG CGAAACGGTCCCAATATCTTGATCG CCAG CTTGG CTCTGGGAGACCTG C 678 
TGCACATCGT(^TTGACATCCCTAT(^TGTCTACAAGCTGCTGGC^GAGGACTGGCC^T 742 

MMI M Ml M I Ml II I Ml II I M IMM MIMI 

TGCACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCCAT 738 
TTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCCGTGGG7\ATCACTG 8 02 

III II II II II I IMM I II III I llllll IMI MM 

TTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCCGTGGGAATCACTG 7 98 
TGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCTTCTTGGAGTAGAA 862 

IIIIIMMIIIIIMIIIMIIMIMIIIIMIIIIIIIIIMIIIIIIIIIIMIII 

TGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCTTCTTGGAGTAGAA 8 58 
TTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTGATTTGGGTGGTCT 922 

II II 'I Ml M I IM i M M MMI MM MM II M; 



Qy 


203 


Db 


200 


Qy 


263 


Db 


259 


Qy 


323 


Db 


319 


Qy 


383 


Db 


379 


Qy 


443 


Db 


439 


Qy 


503 


Db 


499 


Qy 


563 


Db 


559 


Qy 


623 


Db 


619 




DOj 


Db 


679 


Qy 


743 


Db 


739 


Qy 


803 


Db 


799 


Qy 


863 


Db 


859 



Qy 



923 CTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACGATGGACTACAAAG 982 



IIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 

CTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACGATGGACTACAAAG 978 



Db 


919 


CTGTGGTTCTGG CTGTC CCTGAAG CCATAGGTTTTGATATAATTACGATGGACTACAAAG 


978 


Qy 


983 


GAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCTTTCATGCAGTTTT 


1042 






1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 I 1 1 1 1 t 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 M 1 I 1 1 1 E 1 1 1 1 1 1 1 




Db 


979 


GAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCTTTCATGCAGTTTT 


1038 


Qy 


1043 


ACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTGCC^TTGGCCATCA 


1102 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 E 1 i 1 1 1 1 1 E 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1039 


ACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTGCCATTGGCCATCA 


1098 


Qy 


1103 


CTGCATTTTTTTATACACTAATGACCTGTGAAATGTTGAGAAAGAAAAGTGG CATG CAGA 


1162 






IIIMIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 




Db 


1099 


CTG CATTTTTTTATACACTAATGACCTGTGAAATGTTGAGAAAGAAAAGTGG CATG CAGA 


1158 


Qy 


1163 


TTG CTTTAAATGATCAC CTAAAG CAGAGACGGGAAGTGGC CAAAACCGTCTTTTG CCTGG 


1222 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I f 




Db 


1159 


TTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAACCGTCTTTTGCCTGG 


1218 


Qy 


1223 


TCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCTGAAGCTCACTC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


1282 


Db 


1219 


1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 
TCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCTGAAGCTCACTC 


1278 


Qy 


1283 


TTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTGTTGGTATTGGACT 


1342 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1279 


TTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTGTTGGTATTGGACT 


1338 


Qy 


1343 


ATATTGGTATC^CATGGCTTC^CTGAATTCCTGC^TTAACCCAATTGCTCTGTATTTGG 


1402 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 
1 1 1 1 1 E II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 




Db 


1339 


ATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAATTGCTCTGTATTTGG 


1398 


Qy 


1403 


TGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTGGTGCCAGTCATTTG 


1462 






1 1 1 1 II 1 1 II II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 
1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II II II 1 1 1 II II 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1399 


TGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTGGTGCCAGTCATTTG 


1458 


Qy 


1463 


AAGAAAAACAGTC CTTGGAGGAAAAGCAGTCGTG CTTAAAGTTCAAAG CTAATGATCACG 


1522 






1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 ( 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1459 


AAGAAAAACAGTCCTTGGAGGAAAAG CAGTCGTG CTTAAAGTTCAAAGCTAATGATCACG 


1518 


Qy 


1523 


GATATGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAGAAGAACTATTCAC 


1582 






1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 
1 1 1 1 1 E 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 




Db 


1519 


GATATGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAGAAGAACTATTCAC 


1578 


Qy 


1583 


TGTATTTCATTTTCTTTATATTGGACCGAAGTC^TTAAAACAAAATGAAACATTTGCCAA 


1642 






II I ' il ill I 1 1 ! ! I ill' I 1 1 II ill ii 1 1 1 : 1 ! ; r I 




Db 


1579 


TGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAAAACAAAATGAAACA.TTTGCCAA 


1638 


Qy 


1643 


AACAAAACAAAAAACTATGTATTTG CACAGCACACT^ 


1702 






IIIIMIIIIMIIIIMIIMIIIIIIIIIIMIIMIIIIIIMMIIIIIIIMIII 




Db 


1639 


AACAAAACAAAAAACTATGTATTTG CACAGCACACTATTAAAATATTAAGTGTAATTATT 


1698 


Qy 


1703 


TTAACA 17 08 




Db 


1699 


MM 1 

TTAAAA 1704 





RESULT 14 



AAA35163 

ID AAA35163 standard; DNA; 1720 BP. 
XX 

AC AAA35163; 
XX 

DT 28-JUL-2000 (first entry) 
XX 

DE Human adenosine receptor related polynucleotide 2nd SEQ ID NO: 37. 
XX 

KW Human; adenosine receptor; low adenosine antisense oligonucleotide; 

KW phosphorothioate; impaired respiration; inflammation; allergy; 

KW allergic disease; bronchoconstriction; inhibitor; antiinflammatory; 

KW antiallergic; antiasthmatic; cytostatic; analgesic; impaired airway; 

KW lung disease; ischaemic condition; pulmonary vasoconstriction; asthma; 

KW respiratory distress syndrome; pain; cystic fibrosis; emphysema; 

KW pulmonary hypertension; chronic obstructive pulmonary disease; COPD; 

KW cancer; leukaemia; lymphoma; carcinoma; metastasis; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200009525-A2 . 
XX 

PD 24-FEB-2000. 
XX 

PF 03-AUG-1999; 99W0-US17712 . 
XX 

PR 03-AUG-1998; 98US - 00952 12 . 
XX 

PA (UYEC-) UNIV EAST CAROLINA. 
XX 

PI Nyce JW; 
XX 

DR WPI; 2000-205971/18. 
XX 

PT New antisense oligonucleotides useful for treating e.g. pulmonary 

PT vasoconstriction, inflammation, allergies, asthma, hypertension, 

PT bronchitis, emphysema, respiratory distress syndrome, ischemia or 

PT cancers 
XX 

PS Disclosure; Page 1192; 1343pp; English. 
XX 

CC The present invention describes a new composition comprising an 

CC antisense oligonucleotide (ON) with low adenosine (up to 15%) , which 

CC targets nucleic acids involved in bronchoconstriction, allergies, and/or 

CC inflammation. The ON can have antiinflammatory, antiallergic, 

CC antiasthmatic, cytostatic and analgesic activities. The compositions are 

CC useful for the treatment of diseases associated with inflammation, 

CC impaired airways, including lung disease and diseases whose secondary 

CC effects afflict the lungs of a subject. They can be used for treating 

CC e.g. ischaemic conditions, pulmonary vasoconstriction, allergies, 

CC asthma, impeded respiration, respiratory distress syndrome, pain, cystic 

CC fibrosis, pulmonary hypertension, emphysema, chronic obstructive 

CC pulmonary disease (COPD), and cancers such as leukaemias, lymphomas, 

CC carcinomas, and cancers which may metastasise to the lungs, including 

CC breast and prostate cancer. The reduction of the adenosine content of 

CC the ONs reduces side effects. The A-containing ONs break down with the 

CC release of deoxyadenosine which activates adenosine receptors causing 



CC bronchoconstriction and inflammation. AAA32313 to AAA35312 represent the 

CC nucleotide sequences given in the sequence listing from the present 

CC invention, which correspond to SEQ ID N0:1 to 2815, and then the last 

CC 185 sequences are also called SEQ ID N0:1 to 185, but the sequences 

CC differ from the previously named sequences. SEQ ID NO: 11 to 1680 

CC (AAA32323 to AAA33992) are specifically claimed ONs from the present 

CC invention. N.B. Sequences given in the disclosure of the present 

CC invention do not match up with their corresponding SEQ ID NO: sequences 

CC given in the sequence listing. 

XX 

SQ Sequence 1720 BP; 445 A; 400 C; 412 G; 463 T; 0 other; 

Query Match 34.8%; Score 1495.4; DB 21; Length 1720; 

Best Local Similarity 99.8%; Pred. No. 2.5e-303; 

Matches 1503; Conservative 0; Mismatches 2; Indels 1; Gaps 1; 
Qy 2 03 GCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATGCAGCCGCCTCCAAGTCTGTGCG 262 

IMIIIIIIIIMI IIIIIIMIIIIIIMIIIIIIIIIIMIIIIIIIIMIMIMI 

Db 2 00 GCGGCCACCGGACG - CTTCTGGAGCAGGTAGCAGCATGCAGCCGCCTCCAAGTCTGTGCG 258 

Qy 263 GACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTGTCGCGGATCTGGGGAGAGGAGA 322 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMMMIIIIIMIIIIIIMIIIII 

Db 259 GACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTGTCGCGGATCTGGGGAGAGGAGA 318 

Qy 323 GAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAAACCGCAGAGATAATGACGCCAC 382 

lllllllllllll MIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMI 

Db 319 GAGGCTTCCCGCCCGACAGGGCCACTCCGCTTTTGCAAACCGCAGAGATAATGACGCCAC 378 

Qy 383 CCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGTCTGGCGCGGTCGTTGGCACCTG 442 

I : i I I i i : ^ I ; I I : i i i I i I i i i I 

Db 379 CCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGTCTGGCGCGGTCGTTGGCACCTG 438 

Qy 443 CGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCGCCACGCACCATCTCCCCTCCCC 5 02 

IIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIMMMIIIIII 

Db 43 9 CGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCGCCACGCACCATCTCCCCTCCCC 4 98 

Qy 503 CGTGCC^GGACCC^TCGAGATCAAGGAGACTTTCAAATACATCAACACGGTTGTGTCCT 562 

IIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIMMIIIIIIIIIIIIIIIIMI 

Db 4 99 CGTGCC^GGACCCATCGAGATC^GGAGACTTTa\AATACATC^CACGGTTGTGTCCT 558 

Qy 563 GCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGA 622 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIII 

Db 559 GCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGA 618 

Qy 623 ACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGC 682 

I II II II IM M M M II; I III M Mill I! II M II 

Db 619 ACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGC 678 

Qy 683 TGCACATCGTCATTGAC^TCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCCAT 742 

II lllllllllllll I III I'll Ml IMIIIIIIIIMI lllllllllllll 

Db 67 9 TGC^CATCGTC^TTGACATCCCTATCAATGTCTAC^AGCTGCTGGCAGAGGACTGGCCAT 738 

Qy 743 TTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCCGTGGGAATCACTG 8 02 

IIIIIMIMMMMMMMIMMMMIMMMIMMMIMMMIMIIMI 

Db 73 9 TTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCCGTGGGAATCACTG 798 

Qy 8 03 TG CTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAG CTGTTG CTTCTTGGAGTAGAA 862 



II II Mill I! MINI II II ill M IMIII II 'II 

3CTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCTTCTTGGAGTAGAA 858 



Db 


799 


TGCTGAGTCTATGTG CTCTGAGTATTGACAGATATCGAG CTGTTG CTTCTTGGAGTAGAA 


858 


Qy 


863 


TTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTGATTTGGGTGGTCT 


922 






! I 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 
M II 1! II 1 1 II 1 II II 1 1 1 II 1 1 II II 1 II 1 1 1 II II II II II II 1 1 1 1 j 1 1 M 1 1 M 1 




Db 


859 


TTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTGATTTGGGTGGTCT 


918 


Qy 


923 


CTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACGATGGACTACAAAG 


982 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II 1 1 1 II 1 II II M II 1 II II 1 1 1 1 II II 1 1 1 1 1 1 II 1 1 II II II 1 1 1 1 1 1 1 II 1 II 1 1 1 




Db 


919 


CTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACGATGGACTACAAAG 


978 


Qy 


983 


GAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCTTTCATGCAGTTTT 


1042 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


979 


GAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCTTTCATGCAGTTTT 


1038 


Qy 


1043 


ACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTGCCATTGGCCATCA 


1102 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1039 


ACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTGCCATTGGCCATCA 


1098 


Qy 


1103 


CTGCATTTTTTTATACACTAATGACCTGTGAAATGTTGAGAAAGAAAAGTGGCATGCAGA 


1162 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r II 1 1 1 
1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1099 


CTGCATTTTTTTATACACTAATGACCTGTGAAATGTTGAGAAAGAAAAGTGGCATGCAGA 


1158 


Qy 


1163 


TTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAACCGTCTTTTGCCTGG 


1222 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I 




Db 


1159 


TTG CTTTAAATGATCAC CTAAAGCAGAGACGGGAAGTGG C G\AAACCGTCTTTTGC CTGG 


12 18 


Qy 


1223 


TCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCTGAAGCTCACTC 


1282 






1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1219 


TCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCTGAAGCTCACTC 


1278 


Qy 


1283 


TTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTGTTGGTATTGGACT 


1342 






1 1 1 II 1 1 1 1 1 1 1 1 1 II II 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1279 


TTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTGTTGGTATTGGACT 


1338 


Qy 


1343 


ATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAATTGCTCTGTATTTGG 


1402 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1339 


ATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAATTGCTCTGTATTTGG 


1398 


Qy 


1403 


TGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTGGTGCCAGTCATTTG 


1462 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1399 


TGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTGGTGCCAGTCATTTG 


1458 


Qy 


1463 


AAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAAAGCTAATGATCACG 


1522 






II IMIIIII! II ill ill 'lll'lll M .1 IM 111,11 III 




Db 


1459 


AAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAAAGCTAATGATCACG 


1518 


Qy 


1523 


GATATGAC^lCTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAGAAGAACTATTCAC 


1582 






1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II II 1 1 1 1 II 1 




Db 


1519 


GATATGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAGAAGAACTATTCAC 


1578 


Qy 


1583 


TGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAAAACAAAATGAAACATTTGCCAA 


1642 


Db 


1579 


MIMIIIIMMIIIIIIIIMIIIIMMIMIIIIIIIIIIIIIIIIIIIIIIIIII 

TGTATTTC^TTTTCTTTATATTGGACCGAAGTCATTAAAACAAAATGAAACATTTGCCAA 


1638 


Qy 


1643 


AAC^W^CAAAAAACTATGTATTTGCACAGCACACTATTAAAATATTAAGTGTAATTATT 


1702 



I II ill I II, il III II II II ! l. II IIIMI 'II 



Db 



1639 AACAAAACAAAAAACTATGTATTTGCACAGCACACTATTAAAATATTAAGTGTAATTATT 1698 



Qy 1703 TTAACA 1708 

I I I I I 

Db 1699 TTAAAA 1704 

RESULT 15 
ACA56605 

ID ACA56605 standard; cDNA; 1470 BP. 
XX 

AC ACA56605; 
XX 

DT 06-JUN-2003 (first entry) 
XX 

DE Human signalling pathway polynucleotide probe SEQ ID NO 1203. 
XX 

KW Human; probe; ss; array element; Parkinson's disease,- 

KW signalling pathway population; cancer; adenocarcinoma; leukaemia; 

KW immunopathy; AIDS; asthma; neuropathy; Alzheimer's disease; microarray. 

XX 

OS Homo sapiens . 
XX 

PN US6500938-B1. 
XX 

PD 31-DEC-2002. 
XX 

PF 30-JAN-1998; 98US-0016434 . 
XX 

PR 30-JAN-1998; 98US- 0016434 . 
XX 

PA (INCY-) INCYTE GENOMICS INC. 
XX 

PI Au-Young J, Seilhamer JJ; 
XX 

DR WPI; 2003-352189/33. 
XX 

PT Combination of polynucleotide probes, useful as array elements in a 

PT microarray for monitoring the expression of a number of target 

PT polynucleotides 
XX 

PS Claim 1; SEQ ID NO 1203; 65pp; English. 
XX 

CC The invention relates to a combination which, comprises a number of 

CC polynucleotide probes comprising a sequence selected from one of the 1490 

CC sequences mentioned in the specification. The combination is useful as an 

CC array element in a microarray for monitoring the expression of a number 

CC of target polynucleotides. The microarray is particularly useful in the 

CC diagnosis and treatment of cancer and immunopathology and neuropathology. 

CC The microarray is useful in diagnostics and treatment regimens, drug 

CC discovery and development, toxicological and carcinogenicity studies, 

CC forensics and pharmacogenomics . The microarray is also useful for 

CC monitoring progression of diseases and for developing sophisticated 

CC profiles for the effects of currently available therapeutic drugs. The 

CC combination is also useful for purifying a subpopulation of mRNAs , cDNAs 

CC and genomic fragments and in research and diagnostic applications. The 

CC array can detect changes in expression in a large number of genes coding 



CC for different signaling pathway populations which can be used to diagnose 

CC various diseases including cancer e.g. adenocarcinoma and leukaemia, 

CC immunopathies e.g. AIDS and asthma, neuropathies e.g. Alzheimer's disease 

CC and Parkinson's disease. The present sequence represents a polynucleotide 

CC probe of the invention. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification but was obtained in electronic format directly from USPTO 

CC at seqdata . uspto . gov/sequence . html?DocID=06500938Bl . 
XX 

SQ Sequence 1470 BP; 381 A; 343 C; 350 G; 396 T; 0 other; 

Query Match 34.1%; Score 1466.8; DB 25; Length 1470; 

Best Local Similarity 99.9%; Pred. No. 2.3e-297; 

Matches 1468; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

GAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATGCAGCCGCCTCC 251 

llillllllll IIIIIMIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIII 

GAAACTGCGGACGGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATGCAGCCGCCTCC 6 0 
AAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTGTCGCGGATCTG 311 

L:IM lill II II II II I M II I III 111:11 II II I 

AAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTGTCGCGGATCTG 120. 
GGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAAACCGCAGAGAT 371 

IIMIIIIIIMIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

GGGAGAGGAGAGAGGCTTCCCG CCTGACAGGGCCACTC CGCTTTTG CAAACCG CAGAGAT 180 
AATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGTCTGGCGCGGTC 431 

IIIIMIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIMMIII 

AATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGTCTGGCGCGGTC 24 0 
GTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCGCCACGCACCAT 4 91 

IIIIIIIMIIIIIIMIIIIIIMMIIIIIIMIIIIIIIIIIIIMIIIIIIIIMI 

GTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCGCCACGCACCAT 300 
CTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAAT^ 551 

II II Ml I II M II ll : 11:11 II' I II II I MI'll 

CTCCCCTCCCCCGTGCCAA.GGACCCATCGAGATCAAGGAGACTTTCAAATACATCAA.CAC 360 
GGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAAT 611 

Mllh M.h II I II II I Ml II I III II; I illill 

GGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAAT 42 0 
TATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCT 671 

IIIIIIMMIMIIMIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIMIMIIII 

TATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCT 48 0 
GGGAGACCTGCTGCAC^TCGTC^TTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGA 731 

llllllirMIIIMIIMIIIIIIMMIIIIIIIIIIMIIMIMIMIIIIIIMI 

GGGAGACCTG CTG CACATCGTCATTGACATC CCTATCAATGTCTACAAG CTGCTGG CAGA 54 0 
GGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCCGT 7 91 

IMMMIIIIIIIIIIIIIMIIIMIIIIIIMIIIIIIIIIIIIIIIIMIIIIIII 

GGACTGGCCATTTGGAGCTGAGATGTGTAAG CTGGTG CCTTTCATACAGAAAG CCTCCGT 600 
GGGAATCACTGTGCTGAGTCTATGTG CTCTGAGTATTGACAGATATCGAG CTGTTG CTTC 851 

IIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIMIIIIIIMIIIII 



Qy 


192 


Db 


1 


Qy 


252 


Db 


61 


Qy 


312 


Db 


121 


Qy 


372 


Db 


181 


Qy 


432 


Db 


241 


Qy 


492 


Db 


301 


Qy 


552 


Db 


361 


Qy 


612 


Db 


421 


Qy 


672 


Db 


481 


Qy 


732 


Db 


541 


Qy 


792 



Db 


601 


Qy 


852 


Db 


661 


Qy 


912 


Db 


721 


Qy 


972 


Db 


781 


Qy 


1032 


Db 


841 


Qy 


1092 


Db 


901 


Qy 


1152 


Db 


961 


Qy 


1212 


Db 


1021 


Qy 


1272 


Db 


1081 


Qy 


1332 


Db 


1141 


Qy 


1392 


Db 


1201 


Qy 


1452 


Db 


1261 


Qy 


1512 


Db 


1321 


Ov 


1 S79 


Db 


1381 


Qy 


1632 


Db 


1441 



601 GGGAATCACTGTGCTGAGTCTATGTG CTCTGAGTATTGACAGATATCGAGCTGTTG CTT C 660 
TTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTGAT 911 

IIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

TTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTGAT 720 
TTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACGAT 971 

MIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

TTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACGAT 780 

GGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCTTT 103] 
' ■ i I , ^ : I : 



IIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIMIIIIIIIMIIIIIII 



ATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTGAGAAAGAAAAG 1151 

IIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMM 

ATTGG CCATCACTG CATTTTTTTATACACTAATGACCTGTGAAATGTTGAGAAAGAAAAG 960 

TGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAACCGT 1211 

MM II M II I IIMM II IIMI! 1 I III II II IN I ll 

TGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAACCGT 102 0 

CTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCT 1271 

IIMIIIIIIIIIIIMIIIMIMIIIIIMIIIIIIIIIMIIIIIIIIMIIIIIII 

CTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCT 1080 

GAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTGTT 1331 

MMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIII 

GAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTGTT 114 0 



IIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIMMIIMI 



TCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTGGTG 1451 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 E 1 1 1 

TCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTGGTG 1260 

CCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAAAGC 1511 

MMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIII 

CCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAAAGC 132 0 

TAATGATCACGGATATGAC^CTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAGAA 1571 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIM 

TAATGATCACGGATATGACAACTTC CGTTCCAGTAATAAATACAG CTCATCTTGAAAGAA 1380 

GAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAAAACAAAATGAA 1631 

1 1 i I 1 1 1 1 < 1 1 1 : 1 1 1 ll II i li I . I ,|| > 

GAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAAAACAAAATGAA 144 0 
ACATTTGCCAAAACAAAACAAAAAACTATG 1661 

MM M M M' IIMI IMIMM 

ACATTTGCCAAAACAAAACAAAAAACTATG 14 70 



Search completed: December 12, 2003, 11:27:19 
Job time : 1006.18 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



December 12, 2003, 09:16:57 ; Search time 15090.3 Seconds 

{without alignments) 
11659.930 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



US-09-931-157-2 
4301 

1 gagacattccggtgggggac ctgggaaaaaaaaaaaaaaa 4301 

I DENT I TY__NUC 

Gapop 10.0 , Gapext 1 . 0 



Searched: 



2888711 seqs, 20454813386 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5777422 



Database 



GenEmbl : * 
1 : gb_ba : * 
gb_htg: * 
gb_in : * 
gb_om : * 
gb_ov : * 
gb_pat : * 
gb_ph : * 
gb_pl : * 
gb_pr : * 



2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 



gb_ro : * 
gb_sts : * 
9k_sy : * 
gb_un : * 
gb_vi : * 
em_ba : * 
em_f un : * 
em_hum : * 
em_in: * 
em_mu : * 
em_om : * 
em_or : * 
em_ov : * 
em_pat : * 
em_ph : * 
em_pl : * 
em_ro : * 
em sts : * 



28 


em un : * 


29 


em vi : * 


30 


em htg hum : * 


31 


em htg inv : * 


32 


em htg other:* 


33 


em htg mus : * 


34 


em htg pin : * 


35 


ern_htg_rod : * 


36 


em_htg_mam: * 


37 


em__htg_vrt : * 


38 


em_sy : * 


39 


em_htgo_hum : * 


40 


em_htgo_mus : * 


41 


em_htgo other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 



NO. 


Score 


Match Length DB 


ID 


DescriDt ion 


1 


4301 


100 


.0 


4301 


6 


AR177880 


AR17788 0 Seauence 


2 


4301 


100 


.0 


4301 


6 


E07650 


E0765 0 cDNA encodi 


3 


4286 


99 


.7 


4286 


9 


HUMETR 


D904 02 Homo sar> i en 


4 


4284 . 4 


99 


.6 


4286 


6 


AX548828 


AX54 8 82 8 Sequence 


5 


4284 . 4 


99 


.6 


4286 


6 


AX587707 


AX5 87707 Sequenc e 


6 


4284 . 4 


99 


.6 


4286 


9 


S57283 


S57283 Homo sapien 


7 


2857 


66 


4 


2972 


9 


D13162S7 


D13168 Homo sapien 


C 8 


2841.8 


66 


1 


183337 


9 


AL139002 


AL139002 Human DNA 


C 9 


2792.4 


64 


9 


201093 


2 


AC144750 


AC144750 Pan trogl 


10 


2610 


60 


7 


2720 


11 


G06417 


G06417 human STS W 


C 11 


2550 


59 


3 


169751 


2 


AC130785 


AC130785 Papio anu 


C 12 


2550 


59 


3 


185870 


2 


AC129069 


* AC129069 Papio anu 


13 


1691.8 


39 


3 


1873 


6 


AR165435 


AR165435 Sequence 


14 


1691.8 


39 


3 


1873 


6 


E15242 


E15242 Human mRNA 


15 


1690.8 


39 


3 


1872 


9 


S44866 


S44 866 ETB endothe 


16 


1495.4 


34 


8 


1719 


9 


HUMEDNRB 


L06623 Homo sapien 


17 


1466.8 


34 


1 


1470 


6 


AR270640 


AR270640 Sequence 


18 


1466.8 


34 


1 


1470 


9 


HUMETSR 


M74 921 Human endot 


19 


1389 


32 


3 


1603 


9 


BC014472 


BC014472 Homo sapi 


20 


1389 


32 


3 


1632 


6 


AX342673 


AX342 673 Sequence 


21 


1361.4 


31 


7 


1765 


9 


AF114165 


AF114165 Homo sapi 


22 


1327.4 


30 


9 


1329 


9 


AY275463 


AY275463 Homo sapi 


23 


1322 .6 


30 


8 


1329 


6 


AX280873 


AX280873 Sequence 


24 


1222 .8 


28 


4 


1669 


4 


AF019072 


AF019072 Equus cab 


25 


1220.4 


28 


4 


1578 


9 


HSX99250 


X99250 H. sapiens m 


26 


1197 . 8 


27 


8 


2026 


4 


BOVEETBR 


D90456 Bos taurus 


27 


1186 


27 


6 


1452 


4 


AF034530 


AF034530 Canis fam 


28 


1113 


25 


9 


2018 


10 


S65355 


S65355 nonselectiv 


29 


1110.2 


25 


8 


1551 


6 


E05930 


E05930 DNA sequenc 


30 


1104 . 8 


25 


7 


2115 


10 


BC026553 


BC026553 Mus muscu 


31 


1099.2 


25 


6 


1958 


6 


AX305434 


AX3 05434 Sequence 


32 


1099.2 


25 


6 


1958 


10 


MMU32329 


U32329 Mus musculu 


33 


1091 


25 


4 


1892 


10 


RNETBREC 


X57764 Rat mRNA fo 





34 


1091 


25 , 


. 4 


1965 


6 


E03623 


E03 623 DNA encodin 




35 


1086 . 6 


25 . 


. 3 


1311 


4 


AF038900 


AF03 8 9 00 Ecrmi^ rab 




36 


1070 . 4 


24 , 


. 9 


1321 


6 


AR2 07426 


AR2 07426 Seauence 




37 


1067 . 6 


24 , 


. 8 


1314 


4 


AF276427 


AF276427 faniq fam 




38 


1042 . 8 


24 , 


. 2 


1326 


4 


AF245469 


AF24S4 69 Orvrtnl aa 


c 


39 


931 . 8 


21 , 


. 7 


135327 


2 


AC118 537 




c 


40 


931.8 


21 . 


.7 


170121 


2 


AC123546 


AC123546 Felis cat 


c 


41 


922 .2 


21 . 


.4 


192330 


2 


AC122157 


AC122157 Canis fam 




42 


746 


17 . 


.3 


1564 


5 


AF472616 


AF472616 Gallus ga 




43 


732.6 


17. 


. 0 


1041 


5 


CCEDNRB 


X99295 C.coturnix 




44 


588 


13 . 


. 7 


588 


11 


G15922 


G15922 human STS C 




45 


564 .8 


13. 


. 1 


1520 


5 


AF275636 


AF275636 Danio rer 



ALIGNMENTS 



RESULT 1 
AR177880 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 

BASE COUNT 
ORIGIN 



AR177880 4301 bp DNA 

Sequence 3 from patent US 6313276. 
AR177880 

AR177880 . 1 GI : 17920235 

Unknown . 

Unknown . 

Unclassified . 

1 (bases 1 to 4301) 

Itnura,H. , Nakao,K. and Nakanishi,S. 

Human endothelin receptor 

Patent: US 6313276-A 3 06-NOV-2001 ; 

Loca t ion/Qual i f iers 

1. .4301 

/ o rga n i s m= " unknown " 
1342 a 828 c 817 g 1314 t 



linear PAT 17-DEC-2001 



Query Match 100.0%; Score 4301; DB 6; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 4301; Conservative 0; Mismatches 0; 



Length 43 01; 



Indels 



Qy 
Db 

Qy 
Db 



0 ; Gaps 



0; 



1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 

IMIIIIIIIIMIIMIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 



60 



60 



61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIMMIIIMIMII 

61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 



Qy 

Db 

Qy 
Db 



121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 

IIIIMIIIIMIMIIIIMIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIM 

121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 
181 AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 

IIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIMIMMIIIIIIIMIIMM 

181 AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 



180 



180 



240 



240 



Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 



CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 

IMIMM M I III II I III II IM IIIM II II I Ml MM 

CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 3 00 



MMI I I II M I I II IM I M M IIIM I I! 



ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 420 

I II II I M, II II I I Ml II M I MIMI M I Ml I 

ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 420 
CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 480 

I II III I MMI IM IS I MUM M I MUM IM I III I 

CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 
CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 

MIIIIMIIIMIIIilMIIIIIIIMIMIIIIIIIIIIIIIIIIIIMMMIIII 

CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 



IMIIIIMIII IIIMIIIIIIMIIIIII IIILIIIIII! MM Mill 



CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

Ml II I I Ml II I I II IM MM II III I I II I 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 
AGCTTGGCTCTGGGAGACCTGCTGCAC^TCGTCATTGACATCCCTATCAATGTCTACAAG 72 0 

III II I ! II II I I II II I I II III I I M I 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 720 
CTG CTGGCAGAGGACTGGC CATTTGGAG CTGAGATGTGTAAG CTGGTGC CTTTCATACAG 780 

M . ' i I M M M I i ; I M Ml MIML^M 

CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 78 0 
AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 

Ml II I I II M I I II II MMMM II. I I II l 

AAAG C CTC CGTGGGAAT CACTGTG CTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 



MMMI M MlMM I I MIMMM I IMIIMMM MM 



M M I I I II II M M II II MM; II II I MMI I 

ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 
ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCX3TTCAG 1020 

M III I M IMIIMM M I Ml IMM. I Ml II M:M MM 

ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 102 0 
AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 108 0 

M IM I M, IMIM M M IMIMMmMMMMMIMI IIMM 

AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 1080 
1081 TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 



Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 



I I II 1 1 1 1 1 1. 1 1 1 1 M Ml 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 II I II I II I II 



1140 



AGAAAGAAAAGTGGCATG CAGATTGCTTTAAATGATCACCTAAAG CAGAGACGGGAAGTG 12 00 

IMIMIIIMIIMIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIMM 

AGAAAGAAAAGTGG CATG CAGATTG CTTTAAATGATCACCTAAAG CAGAGACGGGAAGTG 12 00 
GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 

MM : ' I I :'i , h M I i I I 1 MM I. 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 
AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 132 0 

1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 

AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 1320 
AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 

Ml II III II II M ill II M Mill II Mill! MMIIMM I 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 
AACC CAATTGCTCTGTATTTGGTGAG CAAAAGATTCAAAAACTG CTTTAAGTCATG CTTA 144 0 

IIIIIIIMIIIIIIIIIMMIIMIIIMIIIMIIIIIMIIIIIIIIIIIIIIIII 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 
TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 

IIIIMMMMIIIIIIIMIMMIIIIMIIMIIIIMIIIIIIIIIIIIIMIII 

TGCTG CTGGTG CCAGTCATTTGAAGAAAAACAGT CCTTGGAGGAAAAG CAGTCGTG CTTA 1500 
AAGTTCAAAGCTAATGATCACGGATATGAC^CTTCCGTTCCAGTAATAAATACAGCTCA 1560 

MIMIM MMIIMM MMIIMM III llllllllll 1111111111111111111 

AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 
TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 1620 

II Mi I Ml MMI MMII I Ml M II II Ml I MMM I 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 1620 



minim Milium iiiiiiiiii iii mm urn i ii ii in Milium ii 



TAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG CTGT 174 0 

I IIIIIIIIII III MM III MMII MM MMM |||| II MM III II 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMMMIIMIMIMMMIMIMIII 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 
TTTTTACAGTTAGC^CTTCAACATAGCTCTTAACAACTTCCAGGATATTC^ 

IIIIIIIIIIIMIIIMIIIIMIIIIIMIIIIIIIIIMIIMIIIMIIIIIIIII 

TTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATT^ 
TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 

IIIIIIIIIMIIIIIIIMIimilMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TAGGCTTAAAAATGAGCTC^CTC^GAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 
AATC^TGGGACTCTGATATAAAGGAAGAATAAGTC^CTGTAAAACAGAACTTTTAAATG 

II I M II II M IMM, MMI M II II I IIMMM I 



Db 



1921 AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 



Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 




Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 



AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 

MIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 204 0 
TATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCAT 2100 

IIMIIIIMIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCAT 2100 
TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAG CAAGG CTGT 2160 

I II M MM M Mill M IM II IM I El M, II I M 

TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 
TTTTGAAAATCATTACACT^ 222 0 

IIIIIIIIMIIIIMIMIIIIIIIIIMIMIIIIIIMIIIIIIIIIIMIIMIII 

TTTTGAAAATCATTACACTTTC^CTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 222 0 
CAACATGTCACAAACAAGCAGCATGTAACAGACTGGCACATGTGCCAGCTGAATTTA 2280 

iiiiiiiiiiMiiiiiiiiiiiiiiiiiiiiiiiiMiiiiiiiiiiiiiiiiiiiiii 

CAACATGTCACAAACAAGCAGCATGTAACAGACTC 22 80 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 

M II II MM M II I MMI II IM Mill IM II II ill 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 
CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

I I II M IM ll IMM MIMM IIMM I II MMIMIl M 

CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 0 0 



IMIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



2460 



TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2 52 0 

I I MM IMM I II I II II Ml II M Ml II II II III 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2 520 
CTG CATGTAGATGATTAAATGAGGG CAGGCCCTGTG CTCATAG CTTTACGATGGAGAGAT 258 0 

IMIIIIIMIIIIIIMIIIIIMIIIIIIIIMMMIIIIIIIIMIIIIIIIIIII 

CTG CATGTAGATGATTAAATGAGGG CAGG C CCTGTG CT CATAG CTTTACGATGGAGAGAT 258 0 
GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACA 2 64 0 

MM Ml I Ml I II I II I Ml IM IMIMM II il Ml 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 264 0 
GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

II M I MMIIMII II Mi l Mill: MMI I MM II I 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 
GCTATAGTTAAAATACTATTTTTCAAAATC^TACAGATTAGTACATTTAACAGCTACCTG 2760 

M II IMM MM I II I II M II Ml lllllllll Ml M II 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 
TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 2820 

I : i M 'I i I '. ^ 'I " III ' Il M I 

TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 282 0 



Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 



ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 

!l I I II U ' ! : : ■ I M I I U ! 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 
AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 

IIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 294 0 
GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3 000 

IMIIIIIIIIIIIIIIIIIIIIIIMIIMMMIIIIIIIIIIIIIIIIMIIIIIII 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3 000 
GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 

I Ml I II I Ml I II M IMM Ml II M M I II 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 



IIIIMIMIIMIIIIMIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 



GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 3180 

I I MM I II Mill! I II II IIIIIMM II M IM II II 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 3180 
ATTTTTACTTTGTTTTTCTTTTAATAGG CTGGG CCACATGTTGGAAATAAG CTAGTAATG 324 0 

I I III I II IMIM I III M llllll II II II IM M II 

ATTTTTACTTTGTTTTTCTTTTAATAGG CTGGG CCACATGTTGGAAATAAGCTAGTAATG 324 0 
TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACOW^CCCAACAATGTC 3300 

IMIIIIIIIIIMIIIIIIIMIMIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCC^CAATGTGGCa\ 3300 
GAAAGAAAGAGCAATAATAATTAATTC^CACACCAT^ 3360 

I Ml I II MIMM I IM II IMIIMII IMM II M II 

GAAAGAAAGAG CAATAATAATTAATTCACACAC CATATGGATT CTATTTATAAATCACCC 3360 



I I II I I : I II .11 :l I I 



CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 3480 

M M M M II I II M II I II I MMI I MMI I 

CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 34 80 

TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 
I I I I I : J III I I I I I I I ' II ! I . ' I I : l 1 1 1 II. I III I III 
TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 

MIMM Ml IM I II IMIM .1 Ml I MIMIMI II II I 

TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 
TGAAACTACACACAAAAAG CATACTTG CATTATTTATAATAAAATTG CATTCAGTGG CTT 3660 

I II III I II M' III I II M IMMi || M MMI IMII 

TGAAACTACACACAAAAAG CATACTTG CATTATTTATAATAAAATTG CATTCAGTGG CTT 3660 



Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 


3961 


Qy 


4021 


Db 


4021 


Qy 


4081 


Db 


4081 


Qy 


4141 


Db 


4141 


wy 


A 9 fi 1 
£t U X 


Db 


4201 


Qy 


4261 


Db 


4261 



TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 3720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 3720 
TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 3780 

IIIIIIIMIIIIIIIIIIIIIIMIIIIIMIIIIMIIMIIIIIIIIIIIIIIIIII 

TTCTTTACATACTC^AAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 3780 
TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 3840 

I II I Ml Ml I IMMI III II MM III M M MMM 

TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 
CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3900 

I II I II II I II Ml Ml II MM II; i Ml II M 

CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3900 
GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3960 
GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4 020 

IM II Ml IM II MMI II IMM II II M IM IM I 

GTTTATAG CAAAACATGGGTATG CTGTAG CTAACTTTATAAAAGTGTAATATAACAATGT 4 020 
AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 08 0 

I IM ! Ml MM I IM IIMM MUM Mill Ml III II 

AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 08 0 
^TATTAT^ 4140 
TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 
ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 4200 

M MI Ml IM II III II Ml IMM II ill Ml II Ml 

ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 
CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTCAATT 4260 

Mill II MMMI I MM II IM I IM II M MIMIMM 

CAAGTGGACATTATTTATGTTAAATATAC^TTATCAAGC^GTATGAAGTTATTCAATT 4260 
AAAATGCCACATTTCTGGTCTCTGGGAAAAAAAAAAAAAAA 4301 

I IM I II III I li IIMM MMI 

AAAATGCCACATTTCTGGTCTCTGGGAAAAAAAAAAAAAAA 4301 



RESULT 2 

E07650 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 



E07650 4301 bp RNA linear PAT 29-SEP-1997 

cDNA encoding endothelin receptor , ETB-receptor . 

E07650 

E07650. 1 GI :2175785 
JP 1994157595-A/2 . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 4301) 



AUTHORS 

TITLE 

JOURNAL 

COMMENT 



PC 



Imura,H., Nakao,I. and Nakanishi,S. 
HUMAN ENDOTHELIN RECEPTOR 
Patent: JP 1994157595-A 2 03-JUN-1994; 
SHIONOGI & CO LTD 
OS Homo sapiens (human) 
PN JP 1994157595-A/2 
PD 03-JUN-1994 
PF 12-JUL-1991 JP 1991172828 

PI IMURA HIROO, NAKAO ICHIKAZU, NAKANISHI SHIGETADA PC 
C07K13/00,C12N5/l0,C12N15/l2,C12P2l/02, (C12N5/10 , C12R1 : 91) 
(C12P21/02, 
PC 
CC 
CC 
CC 
CC 
FH 
FH 
FT 
FT 
FT 

FT /clones 'pHETBR31 and pHTBR34 ' FT 5'UTR 

1. .237 
FT CDS 
FT 

3 *UTR 

FT polyA_signal • 4258. .4263 
FT polyA_signal 3638. .3643 
FT polyA_signal 3134. .3139 
FT polyA_signal 2594. .2599 
FT polyA_signal 1689. .1694. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



C12R1:91) ; 

strandedness : Double; 
topology: Linear; 
hypothetical: No; 
anti- sense: No; 



Key 



source 



Location/Qualifiers 
1. .4301 

/organism^ 1 Homo sapiens ' 
/tissue_type= 'placenta ' 
/clones 'pHETBR31 and pHTBR34 ' 



238. .1566 

/product = 'endothelin 
1567. .4301 



receptor, ETB-receptor ' FT 



Loca t ion/Qual i f iers 
1. .4301 

/ organ ism= "Homo sapiens" 
/mol_type= "genomic RNA" 
/db_xref ="taxon: 9606" 
1342 a 828 c 817 g 1314 



Query Match 100.0%; Score 4301; DB 6; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 4301; Conservative 0; Mismatches 



Length 43 01; 
0; Indels 0; 



QY 
Db 



Gaps 



0; 



1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 

IMIIIIIIIIIIIMIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 

1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 



60 



60 



Qy 

Db 

Qy 
Db 

Qy 



61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 120 

I M II' MM ill I II : ll II II ,1 MMll 'II II il 

61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 
121 AGGATCAAC^CAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 180 

I II IM II I IT I MIMIMI II I II II II'MIM 

121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 18 0 
181 AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

II l 1 II. 1 1 1 [ 1 . 1 1 



Db 



181 AACTTGGCTCTGAAACTGCX3GAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 



Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


Zf \J -L 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 



CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 

IIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMMIIIIIMIIIIIIIIIIIIII 

CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 3 00 
TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

IIIMIMIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIMIIIIIIMI 

TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 
ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 420 

IIIIIIIMIIIIIIIMMIIIIIIIMIIIMMIIMIIIIIIIMIIIIIMIIII 

ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 420 
CTGG CG CGGT CGTTGG CACCTG CGGAGGTGCCTAAAGGAGACAGGACGG CAGGAT CTCCG 48 0 

I : i i ' I I . ; . I ■ . i ! I ■ I i . I 

CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 
CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 

MIIIIIIIMMIIIMIIIIIIIIIIIIIMIIIMIIIIMIIIIMIIIIIIIMI 

CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 
TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

I I ! :l I II III ! ■ I I II :.; I M : . 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 
CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

I II II Ml MMM I II IM I III MMI I Ml III I 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 



II Ml II Ml M MM I II II II IM MMI I Ml Ml M 



CTG CTGG CAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAG CTGGTG CCTTTCATACAG 780 

IIIIIIMIIIMMIIIIIIIIIIMIIIIIIIIMIIIIIIMIIIIIMIIIIIIII 

CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 780 
AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 

I MMIM MMI II I MMM MMM I II Mi ll II M 

AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 



Ml! I Ml M MI I Ml II I II M, MM II IMMM 



ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

I MMM IM II II II MMM M II I IMIMMMMI M 

ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 



I JMM, IM M IMM, M:MI M IM I IMMIMIMIMM 



AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 1080 

M MMM II I II MIMMMM'I ||: || || jMiniiilii 

AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 108 0 



Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 



TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 

i ' ' I II ll 1 1 : ' 

TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 
AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 1200 

I II M il IMMI I II M M II I II M IM II II 

AGAAAGAAAAGTGG CATGCAGATTG CTTTAAATGATCACCTAAAG CAGAGACGGGAAGTG 1200 
GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 

II M Ml II II; II MMM M II II I II M IM II II 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 
AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 132 0 

!i I il M, : II I. i i M 

AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 1320 
AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 138 0 

I! I Ml IMMI MMM II I III IMMIMIM Mill Mill 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 138 0 
AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTOW^AACTGCTTTAAGTCATGCTTA 144 0 

IIIIIIIMIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 
TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TGCTG CTGGTGC CAGTCATTTGAAGAAAAACAGT CCTTGGAGGAAAAG CAGTCGTG CTTA 15 00 
AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 

M II II MMM IMM II MMM MMM II' M II Mill 

AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 
TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 1620 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIMII 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 
AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTATGTATTTGCA 168 0 

II II IM II III li; II III II II, IM MMIMI IMMI II 

AACAAAATGAAACATTTGCCAAAACAAAACAAAAAAC 1680 
TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 

MIIIMM II I Ml I II IMM Ml M II MIMM IM 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 
TTACGG CATGGAAAGAAAATCAGTGGGAATTAAGAAAG CCTCGTCGTGAAAGCACTTAAT 18 00 

M II II Ml MMIM M IMMI IM Ml I Ml II II Ml II 

TTACGG CATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 
TTTTTAC^GTTAGCACTTC^CATAGCTCTTAACAACTTCCAGGATATTCACACAACACT 1860 

I II M II M, M M II II M II MMI MMM Ml M 

TTTTTACAGTTAGCACTTCAACATAGCT 1860 
TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 1920 

I MMM II Ml II I, II II M MIMMMI Ml IMMIM 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 



Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 



AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIMIMIIIIMI 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 
AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 040 

I MM M I MMI I I MMM MM MM ll MM 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 204 0 
TATC^CACTATTATCAGATTGTAATTAGATGC^ATGAGAGAGCAGTTTAGTTGTTGCAT 2100 

I MMM IMMMI II I M MMI M M II II M I Ml 

TATCACACTATTATCAGATTGTAATTAGATG CAAATGAGAGAGCAGTTTAGTTGTTG CAT 2100 
TTTTCGGACACTGGAAACATTTAAATGAT(^GGAGGGAGTAA(^GAAAGAGC^AGGCTGT 2160 

MIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIMIMIIIIIIIIIIIIIIIIIIIII 

TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 
TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 2220 

IMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIMIIIMI 

TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 222 0 
CAACATG T CACAAA CAAG CAG CATGTAA CAGACTGG CA CATG TG C CAG CTGAATTTAAAA 22 80 

IMIIIIIIIIIIMIMIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIMIII 

CAACATGTCACAAACAAGCAGCATGTAACAGACTGGCA 2280 
TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 

IIIIIIIIIIIMIIIIIIIMIIIIIIIIMMIIIIIIIIIIMIIIMIIIMIIII 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 
CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

MM M M II IM IMMMI MMM | MMIMII I M Ml 

CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 
CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 2460 

MM M II II IM M I II M MM MMI M I Ml II 

CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 24 60 
TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2520 

IMIIMIIIMMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIII 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2520 
CTG CATGTAGATGATTAAATGAGGG CAGG CCCTGTG CT CATAG CTTTACGATGGAGAGAT 2580 

IIIMMIIIIIIIIMIIIIIIIIIIIMIIIIIIIIMIMIMIIIIIIIIMIIII 

CTG CATGTAGATGATTAAATGAGGG CAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 2580 
GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 264 0 

IIMMIIIIMMMIIIIIIIIIIMIIMIIIMIIMIIIIIIIMIIIIIMIII 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 264 0 
GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

IIMIIIIIIIIIMIMIIIIIIIIIIIMIMIIIIIIMIIMIIIMIIIMIIII 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 
GCTATAGTTAAAATACTATTTTTCAAAATCATA(^ 2760 

IIMMIIMMIIIIIIIIIIIIIIIIMIIIIIIMIIIMMMIIMIIIIIIIII 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 
2 761 TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 2820 



Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 



IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIII 

TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 282 0 
ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 

MIIIMMIIIMIIIIIIIIIMIIIMIIIIIIIIMIII IIMIII IIIIMI III 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2 880 
AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 

IMIIIIIIIIIIIIIIIIMIIMIIIIIIIMIIIIIMIIIIIIMIIIMIIIIII 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 
GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3 000 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3 000 
GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 

IIIIIIIIIIIIIIMIIMIIMIIIIIMMMIIIIIIIIIIMIIIIIIIIIIMI 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 
CGTCACATCAATGCAAAAGGTCCTGATTTTC 312 0 

II II Ml II II I MM III Mill II II III II II Mill II Mill Mill II Ml II I 

CGTCACATCAATGCAAAAGGTCCTGATTTTGTTC 312 0 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 318 0 

I I I I . I 1 . : I I ! ! I - I ! II 11:11 I 11:1 I ! [ I 1 1 i 1 : ii [ : 1 1 1 

GAGTGACTTTCGAAATAAATTGGG CCCAAGAG CTTTAACTCGGTCTTAAAATATG CCCAA 318 0 

ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

I I I M I M I I I I M M II I I M II I I I I I I I I I I I I II I I I I II I I I I I M I I I I I I II I 
ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCA 33 00 

I M I I I I I II II I I II I II II I I I I II I I I II II I I I I I I I I I II II II I II I I II I I II 
TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCC^ACAATGTGGCCA 3300 

GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 

MUM I Mill II M II I IIMI II II IMI || I 

GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 
AOU^CTTGTTCTTTAATTTCATCCC^ATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 3420 

MM Mi IIMI M IM I MIMIMII IMIIMIIIII III I III 

ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 
O^TTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 34 8 0 

MMI II III II I MhM II I IMM MM MMI IM I 

CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 3480 
TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

IMIMIIIIMIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIMMIIIIMI 

TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 
TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 

MMIMI Ml II I MIMI II I II Ml III I Mill Mill 

TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 
TGAAACTACACACAAAAAGCATACTTGCATTA 3660 

Mill M 'II M I MIMI II I II II III I IN I MNM 



Db 



3601 TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 



Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 


3961 


Qy 


4021 


Db 


4021 


Qy 


4081 


Db 


4081 


Qy 


4141 


Db 


4141 


Qy 


4201 


Db 


4201 


Qy 


4261 


Db 


4261 



TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 

MMIIMM II I MM III MUM I III lllllll III MMMMMMIMM II 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 

TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 3780 

1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 378 0 

TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 

1 1 1 II II II 1 1 1 1 i I II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 II I II 1 1 

TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3960 

I M I M M I I I II I II I I II I I I I I II I I I II I I I I I II I I I I I II I I I I I I I I I I I I II 

GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 

GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4 02 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4 02 0 
AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 08 0 

1 1 1 II 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 II 1 1 

AAAAAATTATATATCTGGGAGGATTTTTTGGTTGC CTAAAGTGG CTATAGTTACTGATTT 4 080 
TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 4140 

I lllllll III lllllll III MIMIIMI MM Ml III lllllll Ml MM III II 

TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 



I II 1 1 II I II 1 1 II II I II I II 1 1 II 1 1 1 1 M I II I II 1 1 II 1 1 II I II I II II II 1 1 1 



CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTCAATT 4260 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 

CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTCAATT 42 60 
AAAATGCCACATTTCTGGTCTCTGGGAAAAAAAAAAAAAAA 4301 

M 1 1 1 II 1 1 1 Ml 1 1 1 1 MM 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M I 

AAAATGCCACATTTCTGGTCTCTGGGAAAAAAAAAAAAAAA 43 01 



RESULT 3 

HUMETR 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



HUMETR 

Homo sapiens ETR mRNA 
D90402 

D90402.1 GI:219651 



4286 bp mRNA linear PRI 18-DEC-2002 
for endothelin receptor, complete cds . 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



gene 
CDS 



Suga , S . , 



polyA_site 



BASE COUNT 
ORIGIN 



Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 4286) 

Ogawa,Y., Nakao,K., Arai,H., Nakagawa,0., Hosoda,K. 
Nakanishi,S. and Imura,H. 

Molecular cloning of a non-isopept ide-select ive human endothelin 
receptor 

Biochem. Biophys. Res. Commun. 178 (1), 248-255 (1991) 

91298956 

1648908 

Data kindly submitted in computer readable form by: Kazuwa Nakao 

Second Division, Department of Medicine 

Kyoto University School of Medicine 

54 Shogoin Kawahara-cho 

Sakyo-ku, Kyoto 606 

Japan 

Phone: 81-75-751-3170 
Fax: 81-75-771-9452 . 

Location/Qualifiers 

1. .4286 

/organism="Homo sapiens" 
/mol_type= M mRNA" 
/db_xref ="taxon: 9606" 
1. .4286 
/gene=" ETR" 
238. .1566 
/gene=" ETR" 
/ codon_s t art = 1 

/product="endothelin receptor" 
/protein_id="BAA14398 .1" 
/db_xref="GI : 219652" 

/ 1 ran s 1 a t i on = " MQP P PSLCGRALVALVLACGLS R I WGE ERGF P PDRATPLLQTAE 
IMTPPTKTLWPKGSNASLARSLAPAEVPKGDRTAGSPPRTISPPPCQGPIEIKETFKY 
I NTWS CLVF VLG 1 1 GNSTLLR 1 1 YKNKCMRNGPN I L I ASLALGDLLH I VI D I P I NVY 
KLLAEDW PFGAEMCKLVPF I QKASVG I TVLSLCALS I DR YRAVASWSR I KG I GVPKWT 
AVE I VL I W WS VVLAVP EA I GFD 1 1 TMD YKGS YLR I CLLH P VQKTAFMQF YKTAKDWW 
LFSFYFCLPLAITAFFYTLMTCEMLRKKSGMQIALNDHLKQRREVAKTVFCLVLVFAL 
CWL PLHLS R I LKLTL YNQNDPNRCELLS FLLVLD Y I G I NMAS LNS C I N P I AL YLVS KR 
FKNCFKSCLCCWCQSFEEKQSLEEKQSCLKFKANDHGYDNFRSSNKYSSS" 
4286 

/gene-" ETR" 
1327 a 828 c 817 g 1314 t 



Query Match 99.7%; Score 4286; DB 9; Length 4286; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4286; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

IIIIIIIIIMMIIIIIIIIIIIMIMIIIMIIMIIIIIIIIIIIIIIIIIIMM 

Db 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

Qy 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 120 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 120 

Qy 121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 180 

I: II I IIIM III I. II II I Ml 'I .11 || ||| in 



Db 



121 AGGAT CAACACAGTGG CTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 18 0 



Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


i-JXJ 


ft 4 1 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 



AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIMMIMIIIIIII 

AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 
CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 3 00 

I IMIMI IMIMMI II MMI II II I II III I! 

CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 
TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

i iiiiii ii MMii in ii i iii iii ii i ii ii 1 r 

TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 
ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 

IIIIIIIIIIIIMIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMM 

ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 
CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 48 0 

IIIIIIMIIIIIIIIIMIIIIIIMMIIIIIIIIMIIIIIIIIIIIIIIMIIIII 

CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 48 0 
CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 

IIIIIMIIMIIIIMMIIMMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 
TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

I llllh III II II: lllill I III ill II II II II I 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 
CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 
AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 72 0 

MM MM II M IMMIMM I Ml II M M Ml II M 

AGCTTGGCTCTGGGAGACCTGCTGC^C^TCGTCATTGAC^TCCCTATC^ATGTCTACAAG 72 0 

CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 78 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIII 
CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 780 

AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 

II II IMIIMI MM MMI III MMII MMIMIII IM 

AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 
GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 

I I II IMIIIM II II MM MMII I II III I 

GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 
ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

M Ml ! ! M ! II M J I M ■ I M I [ 1 1 I II M 1 I 

ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 



MMMMMI II MMII MIMIMI II II I II Ml MM 



Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 



AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 108 0 

Ml! I 1 II! I II I III Ml! IM ! II I II II M 

AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 108 0 
TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 

1 1 ! I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 ! 1 1 i 1 1 j 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 

TTCTCCTTGCCATTGGCC^TaVCTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 
AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 1200 

IM II I ill I! IMM MIMM Mill I II M II II II 

AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 1200 



I M IM I III I II M MM I! M IM ! Ml I IM I I 



AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 1320 

M I MM II IMM III M II MMI IMIIMIM I III II II 

AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 132 0 
AGCTTTCTGTTGGTATTGGACTATATTGGTATCZAACATGGCTTCZACTGAATTCCTGCATT 138 0 

IIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIMII 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 
AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 

M I II I III M II I II M II I! I II M Ml M M 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 
TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 

I I MM Mill MMM I II I II II M MMI Ml MMI 

TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 
AAGTTC^AAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 

IMIIIIIIMMIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMI 

AAGTTCAAAGCTAATGATOICGGATATGAC^CTTCCGTTCCAGTAATAAATACAGCTC^ 1560 
TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 

! I Ml MMM MMMMM MMM MM I II M I 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 1620 
AACAAAATGAAACATTTGCCAAAACA^ 1680 

MMMI MMMI I II IMIMI II MM MIMM 

AACAAAATGAAACATTTGCCAAAACAAAAC 1680 
TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 

MMM Mil l Ml II Mill MIMMMMIMM M II 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 
TTACGG CATGGAAAGAAAATCAGTGGGAATTAAGAAAG CCTCGTCGTGAAAG CACTTAAT 18 00 

MM M II, MMI I II MMMI II MM II MM 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 
TTTTTACAGTTAGC^CTTCAAC^TAGCTCTTAACAACTTCC^GGATATTCACACAACACT 1860 

IIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

TTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATTCAC^C^ACACT 1860 



Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 



TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 1920 

III III MM 1 1 II IMIIII MINIM III INI MM MM III I III MM MM 

TAGG CTTAAAAATGAG CT CACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 1920 
AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 

IIIIIMIIIIIMIMIIIIIIIIIIIMIIIMIIIIIIIIMIIIIIIIIIIIIIII 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 
AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 040 

1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 E 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 
TATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCAT 2100 

1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 

TATCACACTATTAT CAGATTGTAATTAGATGCAAATGAGAGAG CAGTTTAGTTGTTGCAT 2100 
TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 

IMIIIIIIMIIIIIMIIIIMMIIIMIMIIIIIIIIIIIIIIMIIIIIIIMI 

TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 
TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 222 0 

IIIIIIIIIIIIIIIIIMIIMIIMIIIIIIIMIIIIMIIIIIIIIIMIIIIIII 

TTTTGAAAATC^TTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 2220 
CAACATGTCACAAACAAGCAGCATGTAACAGACTGG CACATGTG CCAG CTGAATTTAAAA 2280 

! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 

CAACATGTCACAAACAAGCAGCATGTAACAGACTGGCACATGTGCCAGCTGAATTTAAAA 2280 
TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 

IIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIII 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 2340 
CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 

CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 
CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 2460 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 2460 
TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2 520 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2520 
CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 2580 

IIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIII 

CTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTACGATGGAGAGAT 2580 
GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 2 64 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 264 0 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j I 

ZAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 
27 01 GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 



Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 



1 1 1 1 1 ! 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 II II I ! 1 1 II 1 1 1 1 1 1 1 1 . 1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 1 1 

GCTATAGTTAAAATACTATTTTTOW^TCATACAGATTAGTACATTTAACAGCTACCTG 2760 
TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 282 0 

MIMM I! II II I I IMIIMIM IIIIIMI III I II I 

TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 282 0 
ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 

MMMIMII II M IMMIMM! MIMI MM II I 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 
AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 294 0 

: ! i i ■ M M M I I I M . ii , I , , ; 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 294 0 
GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 

IIIMIIMIIIIIIMIiMIIIMIMIIIIIMMIMIIIIIIIIIIIIMIIIII 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3 000 
GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 

IIIIIIIIIMIIIMMMIMMIIMIIIIIIIIIIMMIMIIIIMMMMM 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 
CGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTGCAATGTTCTCA 312 0 

MIMI 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 i I i 1 1 1 M 1 1 1 [ II ! 1 1 1 1 1 1 1 1 

CGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTGCAATGTTCTCA 312 0 
GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 318 0 

: i > I l ' 1 : 1 1 < III 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 318 0 
ATTTTTACTTTGTTTTTCTTTTAATAGG CTGGG CCACATGTTGGAAATAAGCTAGTAATG 324 0 

1 1 1 1 1 1 1 1 1 1 J 1 1 1 f 1 1 1 1 1 J 1 1 1 f 1 1 1 j 1 1 1 1 1 f 1 1 1 r j 1 1 1 1 r i j 1 1 1 r 1 1 i e 1 1 1 1 j 

ATTTTTACTTTGTTTTTCTTTTAATAGG CTGGG CCACATGTTGGAAATAAGCTAGTAATG 324 0 
TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACC 33 00 

; : i ! i I - : I I : I 1 1 ^ . I ! - ! ! I 

TTGTTTTCTGTC^ATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCA 33 00 
GAAAGAAAGAGCAATAATAATTAATTCAC^C^CCATATGGATTCTATTTATAAATCACCC 3360 

MIMIMMMMIM I Ml MIMI II II IMM I MM 

GAAAGAAAGAGCAATAATAATTAATT CACACACCATATGGATT CTATTTATAAATCACCC 3360 



M MM I M M IMI II IMIMM (MM Ml MM 



CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 34 8 0 

MM MIMMIMI I Mill I M MMIIMM MM 

CATTTTAGACTCTCAA.TTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATT^ 34 80 
TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

MMI IMMIMI M I MM I MM MM IMI II 

TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 



MIMIMI IMI M MMMIMI MIMM I IMM IM 



Db 



3541 TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 



Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 


3961 


Qy 


4021 


Db 


4021 


Qy 


4081 


Db 


4081 


Qy 


4141 


Db 


4 141 


Qy 


4201 


Db 


4201 


Qy 


4261 


Db 


4261 



TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 

IIIIMIIIIMIIIMIIIIIMIIMIIIMIIIIIIIIIIIIIIIIIMIIIIIIII 

TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 
TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 372 0 

IIIMIIMIIIIMIIIIIIMIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIMIII 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 3720 
TTCTTTACA.TACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTC^CTTCAAAACATGT 3780 

1 1 1 1 ' II 1 1 1 1 1 1 1 1 1 1 Ml I llllll hi II I II II 1 1 1 1 1 1 II 1 1 1 II II 'MM 

TTCTTTAC^TACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTC^CTTC^AAACATGT 3780 
TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 

I IMMM IMII II MMM I II IM M II I III! I Ml 

TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 3 84 0 
CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3 900 

I II III IMII II I MM M II IM M II I MM I II 

CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3 900 
GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 

1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 1 1 1 ! 1 1 1 II 1 1 1 1 1 ! 1 1 1 It I E M 1 1 II 1 1 

GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 
GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4 02 0 

I I MI MIMM II I M II II MM II III I MMM II 

GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4 02 0 
AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 080 

IMMMI MIMM IMII II II II MM MMM I IM I Ml 

AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 08 0 
TTTATTATGTAAG CAAAAC CAATAAAAATTTAAGTTTTTTTAACAACTAC CTTATTTTTC 414 0 

M Ml II II II II M II M II II M Ml I IM I III 

TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 
ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 

I II IM IMII III MMMMIMM II M II MMIMMIMI 

ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 
CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTCAATT 4260 

M IMIMMIMI Ml II II IMIMMI II IIMIIMI MMM 

CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGC^AGTATGAAGTTATTCAATT 4260 

AAAATGCCACATTTCTGGTCTCTGGG 4286 

II I I I I I I I I I | | | | | I | | I II I I I 
AAAATGCCACATTTCTGGTCTCTGGG 428 6 



RESULT 4 
AX548828 

LOCUS AX548828 4286 bp DNA linear PAT 26-NOV-2002 

DEFINITION Sequence 113 from Patent WO02061087. 

ACCESSION AX548828 

VERSION AX548828.1 GI: 25813723 



KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Burmer,G.C, Roush # C.L. and Brown , J . P . 

TITLE Antigenic peptides, such as for G protein-coupled receptors 

(GPCRs) , antibodies thereto, and systems for identifying such 
antigenic peptides 
JOURNAL Patent: WO 02061087-A 113 08-AUG-2002; 
Lifespan Biosciences, Inc. (US) 
FEATURES Location/Qualifiers 
source 1. .4286 

/organism="Homo sapiens" 
/mol_type= "genomic DNA" 
/db_xref ="taxon: 9606" 
BASE COUNT 1327 a 829 C 816 g 1314 t 

ORIGIN 

Query Match 99.6%; Score 4284.4; DB 6; Length 4286; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4285; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

IMIIIMIIIMIIIIIIMIIIIMIIIIIIMIIIMIIIIIIIIIIMIIMIIII 

Db 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

Qy 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 120 

IIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

Db 61 AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 

Qy 121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 18 0 

MM M II II I; I II I M Mill IN. I: M I I 

Db 121 AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 18 0 

Qy 181 AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

IMIIIIIMIIIIIIIIM IIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

Db 181 AACTTGGCTCTGAAACTGCGCAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

Qy 241 CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 

IIIIIIIIMIIIIIMMIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIM 

Db 241 CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 3 00 

Qy 301 TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

- I i - > I : . I I ! 1 1 1 ! I I 

Db 301 TCG CGGATCTGGGGAGAGGAGAGAGG CTTCCCG CCTGACAGGGCCACTCCG CTTTTGCAA 360 

Qy 361 ACCGCAGAGATAATGACGCCACCC^CTAAGACCTTATGGCCC^GGGTTCa\ACGCCAGT 42 0 

1 1 i I 1 1 1 ! 1 1 1 1 1 1 1 !i I ! II 1 1 M II ' II I II 1 1 1 1 1 1 1 1 1 Ml M II I i I I II I! 

Db 361 ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 

Qy 421 CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 480 

IIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 

Db 421 CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 8 0 



QY 



4 81 CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 







i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i \ i i i i i i i i i i i i i i i i i i t i i i i i i i 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


CC^CGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC^AA 


540 


Qy 


541 


TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 


600 






i i i i i i i i i i i i i i i i i i t t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCAa\ 


600 


Qy 


601 


CTTCTGAGAATTATCTACAAGAAC^GTGCATGCGAAACGGTCCCAATATCTTGATCGCC 


660 






i i i i t i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i < i i i i i i i i i i i i i i i i i i i i i i t 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 




Db 


601 


CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 


660 


Qy 


661 


AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 


720 






t 1 1 I I I I I I I I I I t t I t I I I < I I I I I I I I I I I I I i i i i r i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


661 


AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 


720 


Qy 


721 


CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 


780 






i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 




Db 


721 


CTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAG 


780 


Qy 


781 


AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 


840 






1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 l l l l l l l l l l l l i l i i i i i i i i i i i i i i i i i t i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 | 1 | | | | | 




Db 


781 


AAAGCCTCCGTGGGAATCACTGTG CTGAGTCTATGTG CT CTGAGTATTGACAGATATCGA 


840 


Qy 


841 


GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 


900 






i i i i i i i i i i i i i i i t i i i t i i i i i i i i i i i i i i i i i i i i i ■ i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


841 


GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 


900 


Qy 


901 


ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 


960 






i i t i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


901 


ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 


960 


Qy 


961 


ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 


1020 






i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


961 


ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 


1020 


Qy 


1021 


AAGACAGCTTTC^TGCAGTTTTAC^GACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 


1080 






i 1 1 1 1 l l l l l l l l l l i i i i i i i i i i i i r i i i \ i i i i i i i i i i i i i i i i i t i i i i t i i i i i 
1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1021 


AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCZAGTTTCTAT 


1080 


Qy 


1081 


TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 


1140 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | 




Db 


1081 


TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 


1140 


Qy 


1141 


AGAAAGAAAAGTGG CATG CAGATTG CTTTAAATGATCACCTAAAG CAGAGACGGGAAGTG 


1200 






i i i i i i i i i i i i i i i i t i i i i t i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1| 




Db 


1141 


AGAAAGAAAAGTGG CATG CAGATTG CTTTAAATGATCACCTAAAG CAGAGACGGGAAGTG 


1200 


Qy 


1201 


GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 


1260 






i i i i i i i i i i i i t i i i i i i i i i i i i i i i i t i i i i i t i t i i i i i i i i i ■ ■ i t i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 I I 




Db 


1201 


GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 


1260 


Qy 


1261 


AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 


1320 






1 M II II M [Mill! II M. Ill I ,M Mi MINI 




Db 


1261 


AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 


1320 


Qy 


1321 


AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 


1380 






II II II M'M MM II II II MM II' M II II MM 





Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2 041 


Ov 




Db 


2101 


Qy 


2161 


Db 


2161 



1321 AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 138 0 
AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 

MIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIII 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 
TG CTG CTGGTG C CAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTG CTTA 1500 

I .MM . ;i M I M'l 1 M I Mi 

TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 15 00 
AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 

IMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 
TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 

I Ml I Ml II I , II I MIMI M Mill M II III I 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 
AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTATGT^ 168 0 

M IM I I MMiM I II I I! II I II II I Mill M 

AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTATGTATTTGC^ 168 0 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 

IIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIII 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 
TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 

M M M Ml IM I II IIMI IMM IMIMMMM II IMM 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 
TTTTTACAGTTAGCACTTCAACATAGCTCTTAAC^CTTCCAGGATATTCACACAACACT 1860 

MIMIIIMIIIIIIIIIIIIMIIIIIIIMIMIIIIIIMMIIIIIIIIIIMM 

TTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATTCACACAACACT 18 60 
TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 

M II I Ml IMM II II I II IM MMI MMM M M M 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 1920 
AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 

MMIIIIIIIIIIIIIIIIIIIMIIMIMIIMIIIIIIIIMIIIIIIIIIIIIII 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 
AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 204 0 

IMM M Ml II I II IMM IIMI IMIMMMM II I M 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 204 0 
TATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCAT 2100 

IMIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIMIMIIII 



IIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIMIMIMI 



TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 2220 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIMI 

TTTTGAAAATCATTAC^CTTTCACTAGAAGCCCAAACCTCAGCATTCTGCAATATGTAAC 222 0 



Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 



CAACATGTCACAAACAAG 228 0 

IIIIIIIIIMIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CAACATGTCACAAACAAGC 2280 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 

Mi III I III II II M III III II II MMII I II MMI 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 

CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

I MiMI M Ml Mill II, Mill ilIMM I II II IM I Ml 

CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 



M I III M IM I Ml M II II MIMM I II I MM I II 



TCA.CTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 252 0 

M I III IMIIMI IM I III II II II III MMI Ml MMI 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 252 0 
CTGCATGTAGATGATTAAATGAGGG CAGG CCCTGTG CTCATAG CTTTACGATGGAGAGAT 2580 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMI 

CTGCATGTAGATGATTAAATGAGGG CAGG CCCTGTG CT CATAG CTTTACGATGGAGAGAT 2580 
GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 264 0 

IIIIIIIIIIIIIIMIIIMIIIIIIIIIMIIIMIIIIIMIIIIIIIIIIIIIMI 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 2 64 0 
GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 27 00 

I Ml IM Mill I MMI Ml II II MMM MMI Ml Mill 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 27 00 
GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 

IIIIIMIIIIIIMIMIIIIIIIIIIIMIIMIIIMIIIIIIIIIMMIIIIMI 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 
TAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAG CCAATAGAAAAGTTTGCTTG 282 0 

M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 

TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 282 0 
ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 

IIIIIIMIIIIIIMIMIIIIIIIIMMIIMIIIIIIIIIIIIIIIIIIIIIIIII 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 288 0 
AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 

MMI M M M II I M I Ml I II II Ml II MM 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 
GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 

n'MI Mi II M IM I II M IMM Ml IM IIMI MMM 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 



MMI Ml II MMI I II MMI I II I Ml II MMI 



Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


■ 3721 




J / ol 


Db 


3781 


Qy 


3841 


Db 


3841 



CGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTGCAATGTTCTCA 3120 

1 1 1 1 J 1 1 1 1 1 ! 1 1 1 1 1 1 ; 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 r j i r J 1 1 1 1 r 1 1 1 1 1 1 

CGTCACATC^TGCAAAAGGTCCTGATTTTGTTCCAGCAA^CACAGTGCAATGTTCTCA 3120 



IMIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIMIIMIIIIIIMIIIIII 



ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 3240 

MM II II I I! l! I Ml II IM I MM.M II IIMM I 

ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 
TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCA 33 00 

I IMIMM M I I M II II II M II M II II II II M 

TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCA 33 00 



M M II M: I M I I M MMI M I I IMIMM M M 

GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCI 
ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAG' 

II II II Ml IM! M I II IMM M M I II IMM II M 

ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAG r 
CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAj 

I II M I I II I I l! ;iM I I I II ll'll II I 



TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

M II M MMM! M M M IMM MM I IMIMM II I 

TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 



MM II I I I I I II || | | M M Ml II IIMM I 



TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 

M Mill 1 h I Ml MMI II MM II li:il lllllllll I 

TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 



MMIIIIIIIIIIIIIIMIIIIIMIMIMIIIIIIIIIIIIIIIIMIIIIIII 



TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 3780 

l : M II I I I M I II II I I M il MIMI II II I 

TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 378 0 



IIIIIIIIIMMIIIIIMIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 
CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 

MMI M M : I I M II M II I M M II II II IM M | 



Qy 



3 901 GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 



IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 3901 GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3960 

Qy 3961 GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4 02 0 

IIIMIIIIIIIIIIIIIMIIIIIIMIIIMIIIIIIIIIIMIIIMIIIIIIIIII 

Db 3961 GTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGT 4 02 0 

Qy 4 021 AAAAAATTATATATCTGGGAGGATTTTTTGGTTG CCTAAAGTGG CTATAGTTACTGATTT 4 08 0 

llllllllllllllll IIIMIIIIillllllllllllllllllMIIIIIIII MINI 

Db 4 021 AAAAAATTATATATCTGGGAGGATTTTTTGGTTG CCTAAAGTGG CTATAGTTACTGATTT 4 08 0 

Qy 4 081 TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 

i II I II MM IMMI II I II 'I ,il I II I I 

Db 4 081 TTTATTATGTAAGCAAAACC^TAAAAATTTAAGTTTTTTTAAC^lACTACCTTATTTTTC 414 0 

Qy 4141 ACTGTACAGAC^CTAATTC^TTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 420 0 

I IL I II L II IIMII M I II I ,IL I M I II 

Db 4141 ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 

Qy 4201 CAAGTGGAC^TTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAG 4260 

Mill Mil III Mil III liriilllliilil III Mil III Mil III Mil MINI 

Db 42 01 CAAGTGGACATTATTTATGTTAAATATAGAk^ 4260 

Qy 4261 AAAATGCCACATTTCTGGTCTCTGGG 42 86 

IMIIIMIIMIIIMIIIIIIIII 

Db 4261 AAAATGCCACATTTCTGGTCTCTGGG 428 6 



RESULT 5 
AX587707 

LOCUS AX587707 4286 bp DNA linear PAT 10-JAN-2003 

DEFINITION Sequence 177 from Patent WO0246467. 
ACCESSION AX587707 

VERSION AX587707.1 GI: 28212378 

KEYWORDS 

SOURCE synthetic construct 

ORGANISM synthetic construct 

artificial sequences . 
REFERENCE 1 

AUTHORS Bertucci,F. , Houlgatte, R . , Birnbaum, D . , Nguyen, C, Viens,P. and 
Fert, V. 

TITLE Gene expression profiling of primary breast carcinomas using arrays 

of candidate genes 
JOURNAL Patent: WO 0246467-A 177 13-JUN-2002; 
Ipsogen (FR) 
FEATURES Locat ion/Qual i f iers 

source 1. .4286 

/organism= " synthetic construct " 
/db_xref = " taxon : 3 2 63 0 " 
/not e= "primer" 
misc_feature 1. .4286 

/note="endothelin receptor type b (EDNRB) gene." 
BASE COUNT 1327 a 829 c 816 g 1314 t 

ORIGIN 



Query Match 99.6%; Score 4284.4; DB 6; Length 4286; 

Best Local Similarity 100.0%; Pred. No. 0; 



Matches 4285; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 6 0 

I : i I : I i ; I : i I : 

GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 6 0 
AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 

IIMIIIIMIIIIIIMIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 
AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 180 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM||||| 

AGGATCAACACAGTGGCTGAACACTGGGAAGGAACTGGTACTTGGAGTCTGGACATCTGA 180 
AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

llllll illllll INI lllllllllll lllllll IIIIIIIIIIIIMIIII 

AACTTGGCTCTGAAACTGCGCAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 
CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 3 00 

llllll lllllll IMIIIIIIIIIIIMI lllllll illlllllllll Mill 

CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 
TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIII 

TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 
ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 

1 1 1 1 1 1 1 ' 1 1 1 1 1 1 I ! 1 1 1 1 1 II III 1 1 1 II 1 1 : 1 1 1 1 1 1 1 1 1 III 1 1 1 II IN 1 1 II 

ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 
CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 48 0 

IIIIIIMIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIMIIIIIIIMIIIIIMII 

CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 48 0 
CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 

MIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

CC^CGC^CC^TCTCCCCTCCCCCGTGCC^GGACCCATCGAGATC^GGAGACTTTC^AA 54 0 
TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

IIIIIMIMIIIIMIIIIIIMIMIIIIIIIIIIMIIIIIIMIIIIIIIIIIIII 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 
CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

Mill I I IMM MMM II II Ml I MMM II M ! MMM 

CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 
AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 720 

IIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIII 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 72 0 
CTG CTGG CAGAGGACTGGCCATTTGGAG CTGAGATGTGTAAG CTGGTGCCTTTCATACAG 78 0 

M'l! ill I. Mill Ml 1 !l II iM I Ml : ll II 'I Ml II 

CTGCTGGCAGAGGACTGG CCATTTGGAG CTGAGATGTGTAAG CTGGTGCCTTTCATACAG 780 
AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 

Mill MM M IM MM: II II II I MUM MM MM 

AAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGA 84 0 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


USJ 


DOl 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 



Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 



GCTGTTG CTT CTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 

IIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMI 

GCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAA 900 
ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

I II 1 1 1 1 M i 1 1 II M M 1 1 1 IN 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 M 1 1 i I i M 1 1 1 1 1 1 1 

ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 
ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 1020 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

ATAATTACGATGGACTACAAAGGAAGTTATCTG CGAAT CTG CTTG CTTCATCCCGTTCAG 1020 
AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 108 0 

II : I . 1 . 1 1 : ll : l II II s I II : 1 1 ! [J II: ll ll l 

AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 1080 
TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 1140 

IMIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIMMIIMIIIIIIIIIIIIIIII 

TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 
AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 12 00 

Milll IMIIMIMI II: II Ml II III I! II III Ml I 

AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 1200 
GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 12 60 

IMMM I Ml MM I III II MM I Ml II MMM 111 IM 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 
AGCAGGATT CTGAAG CTCACTCTTTATAATCAGAATGATCC CAATAGATGTGAACTTTTG 1320 

II III I Ml II I IM II MM I II IM I MM IM I 

AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 1320 
AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 

IIIIIIIIIIIIIIIIIIIIIIIMIIMIMMIIIIIIIIIIIIMIIIIIIMIIII 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 
AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTC^AAAACTGCTTTAAGTCATGCTTA 144 0 

M II ;M IM II II Ml MM II M IM II IM II II II 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 
TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 

II MM I MMM I IM II II II Ml IM MMM III II 

TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 
AAGTT CAAAG CTAATGAT CACGGATATGACAACTT CCGTTC CAGTAATAAATACAG CTCA 1560 

Ml II Mi ll, IMIIMM MMMM M IMIIMM Ml IM M 

AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 



MIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMMMIIIII 



AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTATGTATTTGCACA 1680 

M II I II IM M II II M,M II IM I MM III M 

AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTATC 1680 



Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 



TAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG CTGT 174 0 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

TAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG CTGT 174 0 
TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 

MM I! MMM II IIMIMM II Ml II II M II MM 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 
TTTTTACAGTTAGCACTTCAACATAGCTCTTAACAAC^ 1860 

M M IM MM! II! M MMM Ml MUM Mill Ml Ml II 

TTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTT^ 1860 
TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 

M I MMMI IM III I M M III IMIMMI M IM MMM 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 
AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 

IIIMIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIMI 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 
AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 

M I II IM I Ml I II M II Ml M IM I Ml II II 

AAGCTTAAATTACTC^TTTAAAATTTTAAAATCCTTTAAAACAACTTTTC^ATTAATAT 2 04 0 
TATCACACTATTATCAGATTGTAATTAGATGCA^ 2100 

M M MMM I M I II IMI Ml I MMI IMMIMI 

TATCACACTATTATCAGATTGTAATTAGATG CAAATGAGAGAGCAGTTTAGTTGTTG CAT 2100 
TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIMIIIIMIMIIIMMIIIII 

TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 
TTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGC^TTCTGCAATATGTAAC 2220 

' : " i I " ■ ■ II i : ' i II; 1 ! ! i ' " . i 

TTTTGAAAATC^TTACACTTTCACTAGAAGCCC^ 222 0 

CAACATGTCACAAACAAGCAGCATGTAACAGACT 2280 

IIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CAACATGTCACAAACAAGCAGCATGTAACAGACTGGC^ 2280 
TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 

M II IMMM I III I IMIII II Ml | Ml | ML || | 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTAC^TTCAGTTAAGATCAAACCTCA 234 0 
CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

II III II IM I MMM MMMI I Ml Mi II. Mill II I 

C^\AAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATT(^V 24 00 
CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 24 6 0 

M Ml I M M IhMUM IMMUI M Ml Ml IMUIM 

CATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTT 2460 
TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 252 0 

MMIIIIIIIMIIIIIIMIMIIMIIMIIMIMIIIMIIIIIMIMIIMM 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2520 
2521 CTGCATGTAGATGATTAAATGAGGGCAGGCC CTGTGCT CATAG CTTTACGATGGAGAGAT 2 58 0 



Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 



I ill I ,1 !i ^ I I : , lh II < 

CTGCATGTAGATGATTAAATGAGGG CAGG CCCTGTG CTCATAGCTTTACGATGGAGAGAT 2 580 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 2 64 0 

III lllllllllll llllll MINI IIIIIMIMI llllllllllllllll 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 2 64 0 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

IMIIIIIIMMIIIMIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIII 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2 760 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIMIIIIIIIIMIIIIIII 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 

TAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAGC CAATAGAAAAGTTTG CTTG 2 82 0 

IMIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIMIIIIIII 

TAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAGC CAATAGAAAAGTTTG CTTG 2 82 0 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2 880 

i 1 1 1 . i i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 i;mi iimi 1 1 1 1 1 1 1 1 1 1 . 1 1 1 1 1 1 1 1 1 1 1. 1 1 1 1 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 

M M . ,1 II i I !. : ,; I ■ I I M I ! : I 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3 000 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 ' 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 ! I 1 1 1 1 1 M 1 1 1 1 1 1 1 II 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 

: i . : : 'I - . . i I . i ; : . i 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 

CGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAA 312 0 

I ! I i I II. i I , ; I I 

CGTCACATCAATGCAAAAGGTCCTGATTTTGT 3120 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 3180 

IIIIIIMMIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMII 

GAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAA 318 0 

ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

M I Ml I Ml M II. I MM! II IM I II II II I I 

ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCC^ 3300 

IIMMIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIMIIMIMI 

TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCC^ 3300 

GAAAGAAAGAG CAATAATAATTAATTCACACAC CATATGGATTCTATTTATAAATCACCC 33 60 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIMIIIIIIIIIIIII 

GAAAGAAAGAG CAATAATAATTAATTCACACAC CATATGGATTCTATTTATAAAT 3360 

ACAAACTTGTTCTTTAATTTCA.TCCCAATC^CTTTTTC^GAGGCCTGTTATCATAGAAGT 3420 

M IM I M M Ml ! M M, II Ml I Ml I Ml M M 



Db 



3361 ACAAACTTGTTCTTTAA.TTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 3420 



Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 


3961 


Qy 


4021 


Db 


4021 


Qy 


4081 


JJD 


4 (Jo 1 


Qy 


4141 


Db 


4141 


Qy 


4201 


Db 


4201 



CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 348 0 

1 1 1 ! 1 1 1 I : 1 1 1 1 1 II 1 : 1 1 1 ' 1 1 1 1 1 1 1 l I II I 

CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 348 0 
TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

II: II I III II' Ihll II I ill IIMI I MIMI II 

TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 
TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 

III II I Ml II MMI II i Ml IM II I MM II M 

TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 
TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3 660 

MIMM I Ml MIMIIMI II M III IMIIII I MM MMM 

TGAAACTACACA.CAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 
TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 3720 

II IM M IMMI M IM II II MMM II II II MM I 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 3720 
TTCTTTACATACTCAAAACCAAGATAGAAAAA 378 0 

II II M IM M M IM II II MMM II M II Ml M 

TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAAQVTGT 378 0 
TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 

II II Ml II II M, II MMIM II IM II II IMMI MM 

TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 3 84 0 
CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3900 

I II IM II II M II Ml Ml II II II M IM II M 

CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3900 
GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3960 

IIIIIIIIIMMIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIMIMM 

GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3960 
GTTTATAG CAAAACATGGGTATG CTGTAG CTAACTTTATAAAAGTGTAATATAACAATGT 4 020 

M MM I MMM II II Ml IMIMMI MM M II MIMM 

GTTTATAG CAAAACATGGGTATGCTGTAG CTAACTTTATAAAAGTGTAATATAACAATGT 4 020 
AAAAAATTATATATCTGGGAGGATTTTTTGGTTG CCTAAAGTGG CTATAGTTACTGATTT 4 08 0 

I II Mi II M M II MMM IM II II M' II III Ml 

AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 08 0 
TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 

IM II I Ml MM MMMMM || M M II M MM IM II 

TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 
ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 

M II IM Ml II II II IM II MMM MMM II Ml III 

ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 
CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTCAATT 4260 

II MM II II M. II IM IIMMMI MM II II MMM 

CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTCA^ 42 60 



Qy 4261 AAAATGCCACATTTCTGGTCTCTGGG 4286 

i I I I I I I I I I I I I I 1 I I I I I 1 1 t I I I 
Db 4261 AAAATGCCACATTTCTGGTCTCTGGG 4286 



RESULT 6 

S57283 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REMARK 



FEATURES 

source 



S57283 4286 bp mRNA linear PRI 18-MAR-2002 

Homo sapiens endothelin ET-B receptor mRNA # complete cds . 

S57283 

S57283.1 GI:298321 



CDS 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 

Ogawa , Y . , Nakagawa , 0 . , Komat su , Y . and 



BASE COUNT 
ORIGIN 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 , to 4286) 
Arai,H., Nakao,K., Hosoda,K., 
Imura , H . 

Molecular cloning of human endothelin receptors and their 

expression in vascular endothelial cells and smooth muscle cells 

Jpn. Circ. J . 56 Suppl 5, 1303-1307 (1992) 

93180293 

1291713 

GenBank staff at the National Library of Medicine created this 
entry [NCBI gibbsq 128424] from the original journal article. 
This sequence comes from Fig. 5. 

Location/Qualifiers 

1. .4286 

/organism="Homo sapiens" 
/ mo 1 _ t yp e = " mRNA " 
/db_xref="taxon: 9606" 
238. .1566 

/gene=" endothelin ET-B receptor, ET-BR" 
/note="This sequence comes from Fig. 5; ET-BR" 
/codon_start=l 

/product=" endothelin ET-B receptor" 
/protein_id="AAB25531 . 1" 
/db_xref="GI : 298322" 

/translation^ 1 MQPPPSLCGRALVALVLACGLSRIWGEERGFPPDRATPLLQTAE 
IMTPPTKTLWPKGSNASLARSLAPAEVPKGDRTAGSPPRTISPPPCQGPIEIKETFKY 
I NT WSCLVFVLGI I GNSTLLR I I YKNKCMRNGPN I L I AS LALGDLLH I VI D I P I NVY 
KLIAEDWPFGAEMCKLVPFIQKASVGITVLSLCALSIDRYRAVASWSRIKGIGVPKWT 
AVE I VLI W WS WLAVPEAI GFD 1 1 TMDYKGS YLRI CLLHPVQKTAFMQF YKTAKDWW 
LFSFYFCLPLAITAFFYTLMTCEMLRKKSGMQIALNDHLKQRREVAKTVFCLVLVFAL 
CWLPLHLSR I LKLTLYNQNDPNRCELLS FLLVLDY I G I NMASLNS C I NP I ALYLVS KR 
FKNCFKSCLCCWCQSFEEKQSLEEKQSCLKFKANDHGYDNFRSSNKYSSS" 
1327 a 829 c 816 g 1314 t 



Query Match 99.6%; Score 4284.4; DB 9; Length 4286; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4285; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 



Db 



1 GAGACATTCCGGTGGGGGACTCTGGCCAGCCCGAGCAACGTGGATCCTGAGAGCACTCCC 60 



Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 






Qy 


781 


Db 


781 


Qy 


841 


Db 


841 



AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 

IIIIMMIIIIIII II II !l II M II IIIIIIIIIIMMIIII in 

AGGTAGGCATTTGCCCCGGTGGGACGCCTTGCCAGAGCAGTGTGTGGCAGGCCCCCGTGG 12 0 



IIIIIIIIIIIIIIMIIIIMIIMMIMIIIIIIIIIIMIIMIIIII 



AACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 

IMIIIIIIIIIIIIIIMI IIIMIIIIMIIIIMIIIIIIIIIIIMIMIIIIII 

AACTTGGCTCTGAAACTGCGCAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATG 24 0 
CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIII 

CAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTG 300 
TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 

I ! I ! 1 1 1 1 1 1 1 1 1 1 1 1 1: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I M 

TCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAA 360 
ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIMIIIIIMIIIIIIIIII 

ACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGT 42 0 
CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 80 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIMIMM 

CTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCG 4 8 0 
CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 

IIIIMMIIMIIIIIIMIIIIIIIIIIIIIMIIIIIIMIIIIIMIIIMIIIM 

CCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAA 54 0 
TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 

I I I M I I II MMI M II II M II II IMIMI M M 

TACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACA 600 
CTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCC 660 

IIIIIIMIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIMIIIMIMM 

CTTCTGAGAATTATCTACAAGAACAAGTG CATGCGAAACGGT CC CAATATCTTGATCGC C 660 
AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 720 

MM, I M I MM MMI M II M II M II MMI MM, I 

AGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAG 720 



M I M M IMM IMI II MMI IIMIMIMI M MM I 

:tgctggcagaggactggccatttggagctgagatgtgtaagctggtgcctttcataca( 

aagcctccgtgggaatcactgtgctgagtctatgtgctctgagtattgacagatatcgi 

IIIMMIIMIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIMMIIIIIMI 

AAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGJ 
CTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAJ 

I II I II M I I Ml MM MMIMIII! M MMI I 



Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 



ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 

M> I M' ll I II M M MMI Ml M Ml M III M II 

ATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGAT 960 
ATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 102 0 

MM II Ml I III I II II M II I I! II MIMMII 

ATAATTACX3ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAG 1020 
AAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTAT 1080 

MIMIIIIIMIIIIIIIIIIIIIIIIIMIIIMIIIIIIMIIIIIIIIIIIIIIII 

AAGACAG CTTTCATG CAGTTTTACAAGACAG CAAAAGATTGGTGGCTGTTCAGTTTCTAT 1080 
TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 

IIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIMI 

TTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTG 114 0 
AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 1200 

IMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMMIIIIMI 

AGAAAGAAAAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTG 12 0 0 
GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 

IIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIMIIM 

GCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTC 1260 
AGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 132 0 

M Mill II M II I II I! II MMM II MMMMI | 

AGCAGGATTCTGAAG CT CACT CTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTG 132 0 
AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 138 0 

I II II II M MM M IMMMM II I Ml II IM IMM 

AGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATT 1380 
AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 

M MMI IM M II M II II I MMM ! IMMMM Ml 

AACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTA 144 0 
TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 

I II I Ml II MM I II MUM II I II IMM III I 

TGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTA 1500 
AAGTTC^AAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 

II Ml I MM I II M IM II M II MMM II I MMM 

AAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCA 1560 
TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 162 0 

M II MMI II MM I M MMM MMM II MMM II I 

TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAA 1620 
AACAAAATGAAACATTTGCCAAAACAAAAC 168 0 

I IMM II M'MI M Ml Mi I MMM IMMM I Ml 

AACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTATGTATTTGC^ 168 0 

TAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG CTGT 174 0 

M II MMIMMMI MMM III MMliMIMIMIIMMMI I 

TAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGT 174 0 



Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 



TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 1800 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1! 1 1 1 1 1 1 1 II II 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 II II 

TTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAAT 18 00 
TTTTTAC^GTTAGCACTTC^CATAGCTCTTAAC^CTTCCAGGATATTC^C^CAACACT 1860 

MMIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIII 

TTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATTCA 1860 

TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
TAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTA 192 0 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 

IIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIMIIIIIIIIMI 

AATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATG 1980 
AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 

I II M IM I II I II M MMM I Mill MMI M 

AAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATAT 2 04 0 
TATCACACTATTAT CAGATTGTAATTAGATG CAAATGAGAGAGCAGTTTAGTTGTTG CAT 2100 

IIIMIMIMIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIMIIIIIIIIMI 

TATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCAT 2100 
TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 

IIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIII 

TTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGT 2160 
TTTTGAAAATCATTACACTTTCACTAGAAGCCC^AACCTCAGCATTCTGCAATATGTAAC 222 0 

Ml MM M II I Ml M III I ill II Ml II M M MMI 

TTTTGAAAATCATTAC^CTTTCACTAGAAGCCC^AACCTCAGCATTCTGCAATATGTAAC 222 0 
CAACATGTCACAAACAAG CAG CATGTAACAGACTGG CACATGTGCCAG CTGAATTTAAAA 228 0 

IIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

CAACATGTCACAAACAAGCAGCATGTAACAGACTGGCACATGTGCCAGCTGAATTTAAAA 2280 
TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 

I Ml II M IM I Ml M MMM II III IMIMM IM I I 

TATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCA 234 0 
CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 

MMM I II I MMIMM M II II IMIl M III MMM 

CAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCA 24 00 



2460 



CATACCCT 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 

IIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACA 2 52 0 
CTGCATGTAGATGATTAAATGAGGGCAGGC CCTGTG CT CATAG CTTTACGATGGAGAGAT 258 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

CTG CATGTAGATGATTAAATGAGGG CAGG CCCTGTG CTCATAG CTTTACGATGGAGAGAT 2580 
2581 GCC^GTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTG 2 64 0 



Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 



IE I II 1 1 ; 1 1 ll'il ill ti II. til I ^ 1 1 ! i l j 

GCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGG 2 64 0 
GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIMIIIIIIIIIMIIIIIIMIII 

GCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAAT 2700 
GCTATAGTTAAAATACTATTTTTC^AAATCATACAGATTAGTACATTTAACAGCTACCTG 2760 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAG CTAC CTG 2760 
TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 2820 

II M II Mill M IM II M II II Ml Mi ll MMI III 

TAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTG 282 0 
ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 2880 

I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I 

ACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTT 288 0 
AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 

I I IM MMI MM I Ml II III Ml M III M III I MM 

AGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTT 2 94 0 
GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3000 

II II II MUM Ml I Ml M Ml II Ml IMMI III I Ml 

GGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTG 3 000 
GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3060 

I MMI Ml II II I II I II II M' II M; II IM II 

GGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCT 3 060 
CGTCACATCAATGCAAAAGGTCCTGATTTTGTTCC^^ 312 0 

M I III II III II Ml II II III Ml I MM I III Ml I 

CGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAG 312 0 



M IM MMI Ml MMM I IM II I MM I II I I 



ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 

IIIIIMIIIMIIIIIIIIIIIIMMIIIIMIIIIIIIIIIIIIIIIIIIIIIMII 

ATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATG 324 0 
TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCC^CAATGTGGC^ 3300 

IIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIMIIIIIIIIMIIIIMMIIIIMI 

TTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCA 3300 
GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 

I M 1 1 !! 1 1 1 M I ! 1 1 1 ! 1 1 I ! 1 1 1 1 M , !l 1 1 1 1 1 1 , 1 [ 1 1 1 1 1 1'l 1 1 1 1 1 1 N I ! 

GAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCC 3360 
ACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGT 342 0 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIII 

AO^CTTGTTCTTTAATTTCATCCC^TCACTTTTTCAGAGGCCTGTTATC^TAGAAGT 342 0 
CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 34 8 0 

IIIMIIIMIIIMIIIIIIIIMIMIIMIMIIIIIIIIIIIIIIIIIIIIMIII 



Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 


3961 


Qy 


4021 


Db 


4021 


Qy 


4081 


Db 


4081 


Qy 


4141 


Db 


4141 






Db 


4201 


Qy 


4261 


Db 


4261 



3421 CATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAA 34 8 0 
TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 

1 1 i : 1 1 1 I ; 1 1 1 , 1 1 I II' ; I ! I 

TATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACA 354 0 
TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 

IIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIII 

TCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTT 3600 
TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 

I II 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 II I II' 1 1 1 1 1 1 1 1 1 1 1 ni! II 1 1 1 1 1 1 M 

TGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGGCTT 3660 
TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 3720 

I II I II II MM! M M Ml II II M MIMI I 

TTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAAT 3720 

TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 3780 

I I I I I I I M I I I I I I I I I I II I II II I I II II I I I I I I I I I ll I I I II III I I II 'I 
TTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGT 378 0 

TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAAC1ATGGATGTTA 384 0 

IMMIMMMIII llllllllllllllllllllllllllllllllllllllllllll 

TTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATGTTA 384 0 
CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3 900 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

CAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAAT 3 900 
GTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 

1 1 ! i I : ! . i i I : , - - : i : ! I ! i . I 

GTGGATGTATGTTC^ACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAATACA 3 960 
GTTTATAGCAAAACATGGGTATG CTGTAG CTAACTTTATAAAAGTGTAATATAACAATGT 4 020 

I: MM: II II Mill Ml I IN Ml ME! I MINI II 

GTTTATAGCAAAACATGGGTATG CTGTAG CTAACTTTATAAAAGTGTAATATAACAATGT 4 02 0 
AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 08 0 

II II M II IM I III Ml I Ml Ml MUM II Ml M 

AAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTT 4 08 0 
TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 

M MM I MIMM M M M il II M Ml MMI Ml 

TTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTC 414 0 
ACTGTACAGAC^CTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 4 2 00 

II II Ml II II I ill Ml M IM III II I MM III II 

ACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGA 42 00 
CAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTA 4260 

IMMM I MIIMMII II IM IMIIMM MIMI II III Ml 

CAAGTGGACATTATTTATGTTAAATATACAATTATC^GCAAGTATGAAGTTATTCAATT 4260 

AAAATGCCACATTTCTGGTCTCTGGG 4286 

MIIIMIMIIIIIIIIIIIIIIII 
AAAATGCCACATTTCTGGTCTCTGGG 4286 
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D13162S7 2972 bp DNA linear PRI 12-OCT-2002 

Homo sapiens hET-BR gene for endothelin-B receptor, complete cds 
and exon 7 . 
D13168 

D13168 . 1 GI :285924 
7 of 7 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 2972) 

Arai,H., Nakao,K., Takaya,K., Hosoda,K., Ogawa,Y., Nakanishi,S. and 
Imura,H. 

The human endothelin-B receptor gene. Structural organization and 
chromos oma 1 as s ignmen t 

J. Biol. Chem. 268 (5), 3463-3470 (1993) 

93155196 

8429023 

2 (bases 1 to 2972) 
Arai # H. 

Direct Submission 

Submitted (02 -SEP- 1992) Hiroshi Arai, Kyoto University School of 
Medicine, Second Division, Department of Medicine; 54 Shogoin, 
Kawahara-cho, Sakyo-ku, Kyoto, Kyoto 606, Japan 
(Tel: 81-75-751-3170, Fax:81-75-771-9452) 

Location/Qualif iers 

1. .2972 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref = n taxon: 9606" 
/ chromosome= " 13 " 

/clone_lib= "human genomic libraries" 

join (D13 162 .1:1002. . 1742, D13 163 . 1: 11. .123, 

D13 164. 1:11. . 215, D13165. 1:11. . 160 , D13 166 . 1 : 11 . .144, 

D13167.1:ll. .119,11. .2865) 

/gene="hET-BR" 

join(D13162. 1: 1260. . 1742, D13 163 . 1: 11. .123, 

D13164. 1: 11. . 215, D13 165. 1:11. . 160 , D13166 . 1 : 11 . .144, 

D13167.1:ll. .119,11. .145) 

/gene= "hET-BR" 

/note="G protein-coupled receptor" 
/codon_start=l 

/product = " endothel in-B receptor" 
/protein_id="BAA02445 .1" 
/db_xref="GI : 285926" 

/translation "MQPPPSLCGRALVALVLACGLSRIWGEERGFPPDRATPLLQTAE 
IMTPPTKTLWPKGSNASLARSLAPAEVPKGDRTAGSPPRTISPPPCQGPIEIKETFKY 
INTWSCLVFVLGI I GNSTLLR 1 1 YKNKCMRNG PN I L I AS LALGDLLH I VI D I P I NVY 
KLLAEDWPFGAEMCKLVPF I QKASVG I TVLSLCALS I DR YRAVASWSR I KG I GVP KWT 
AVEI VLIWWSWLAVPEAIGFDI ITMDYKGSYLRI CLLHPVQKTAFMQFYKTAKDWW 
LFSFYFCLPLAITAFFYTLMTCEMLRKKSGMQIALNDHLKQRREVAKTVFCLVLVFAL 
CWLPLHLSR I LKLTL YNQND PNRCELLS FLL VLDY I G I NMASLNS C I N P I ALYLVS KR 



FKNCFKSCLCCWCQSFEEKQSLEEKQSCLKFKANDHGYDNFRSSNKYSSS" 
exon 11. .2865 

/gene="hET-BR" 

/product="endothelin-B receptor" 
/note="G protein-coupled receptor" 
/number =7 

/ evidence=experimental 
BASE COUNT 1018 a 499 c 465 g 990 t 

ORIGIN 



Query Match 66.4%; Score 2857; DB 9; Length 2972; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2857; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

AGTCATGCTTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGC 1489 

II II II MM! Mi li II MINI MM II IN li II IN I 

AGTCATGCTTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGC 6 8 
AGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATA 154 9 

MMMMMMMIMMMMMMMMMMMMMMMMMMMMIMM 

AGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATA 128 
AATACAGCTCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACC 1609 

II Mill Ml II I MIMI I MMI IIIIIMI Mill MM 

AATACAGCTCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACC 188 
GAAGTCATTAAAAGAAAATGAA^ 1669 

Mill I Mill MMIIMM I II IN M IMM M Ml III I 

GAAGTCATTAAAACAAAATGAAACATTTGC(^^ 248 
CAGCACACTATTAAAATATTAAGTGTAAT^ 172 9 

I II Mill Mill II I II Mill III II MMI III II III Mil I II III II III II II 

CAGCACACTATTAAAATATTAAGTGTAATTATT^ 308 
TTATGAGCTGTTTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGA 1789 

Ml IMM Ml II I Mill I IMM MMI M II Ml II M 

TTATGAGCTGTTTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGA 368 
AAGCACTTAATTTTTTACAGTTAGCACT^ 184 9 

Mill I Mill IM II IMM II M; I III I I IIMM I 

AAGCACTTAATTTTTTACAGTTAGCACTTC^CATAGCTCTTAAaVACTTCCAGGATATT 428 

OVCACAAC^CTTAGGCTTAAAAATGAGCTC^CTCAGAATTTCTATTCTTTCTAAAAAGAG 1909 

I I I I I I I M II I II I I II II I I II I I I I II I II II I I I I I I I I I I I II I II I I I I I I I M 
C^CAC^CACTTAGGCTTAAAAATGAGCTCACTC^GAATTTCTATTCTTTCTAAAAAGAG 488 

ATTTATTTTTAAATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGA 1969 

MIMMM Mill M III Ml I MMMMNIMM M III I I 

ATTTATTTTTAAATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGA 54 8 
ACTTTTAAATGAAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTT 2029 

Ml IMM Ml II I MIMI MMM MMMI MMI II I 

ACTTTTAAATGAAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTT 608 
TCAATTAATATTATCA.CACTATT^ 2089 

Ml III IMM II III II Mill III MIMMM I II Mill II Mill Ml II Ml II 

TCAATTAATATTATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTT 668 
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2630 
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1209 


Qy 


2690 


Db 


1269 


Qy 


2750 


Db 


1329 


Qy 


2810 


Db 


1389 


Qy 


2870 


Db 


1449 



AGTTGTTGCATTTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAG 214 9 

IIIIIIIIIIIIIMIMIIIIIIMIIIIMIIIIIIMIIIIIIIIIIMIIIIIIII 

AGTTGTTGCATTTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAG 72 8 
AGCAAGGCTGTTTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTG 22 09 

IIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIMMMIIIIIMIIIIIIIIIIIIII 

AGCAAGGCTGTTTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTG 788 
CAATATGTAACCAACATGTCACAAACAAGC^ 22 69 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIMMIIIIIIMIIIIIIIIIIM 

CAATATGTAACC^CATGTCACAAACAAGCAGCATGTAAC^GACTGGCACATGTGCCAGC 848 
TGAATTTAAAATATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAG 232 9 

Mill lllillllMIIIIIIIIMMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMI 

TGAATTTAAAATATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAG 908 
ATCAAACCTCAOU^GAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTG 238 9 

IIIIIIIIIIIIMIIIIMIMIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIII 

ATCAAACCTCACAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAA 968 
TCTGTCATTCACATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAA 244 9 

1 1 1 1 1 III 1 1 1 II 1 1 I II I II I II 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1! 1 1 II I II II 1 1 1 II 

TCTGTCATTCACATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAA 1028 
ATCTTCTTTTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTT 2 5 09 

I M II Mill Mill II I! MMM II Mill MMM M I 

ATCTTCTTTTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTT 108 8 
ACCTACATACACTG CATGTAGATGATTAAATGAGGGCAGG CC CTGTG CTCATAGCTTTAC 2569 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

ACCTACATACACTG CATGTAGATGATTAAATGAGGGCAGGCCCTGTG CTCATAGCTTTAC 114 8 



M II M IMM MM I II IM II II Mi IMM II M I 



ATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGC 2 68 9 

MIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

ATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGC 1268 
ATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTA 2 74 9 

I II M II IMM I II IM II II M> III II II M I 

ATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTA 1328 
ACAGCTACCTGTAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAA 280 9 

M II M II MM! II II MMM IMMI III II IM II I 

ACAGCTACCTGTAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAA 1388 
AAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGT 2869 

M I MMM Ml M II MMMM II Ml I Ml I MMM III 

AAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGT 1448 
AAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCT 2 92 9 

II II II II Mill M II II IMM IM II II II I I 

AAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCT 1508 
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TAGGATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAG 2 98 9 

MIIIIIIIIIIMIIIIIIIIIMIIIIIIIIMIIIIIIIMIIIIIMIIIIIIIII 

TAGGATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAG 1568 
GAAATGAGGTGGGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTT 304 9 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIII 

GAAATGAGGTGGGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTT 1628 
CGTCATTGCCTCGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTG 3109 

IM IM I II II Mill M ! MI MM III I MM II I! 

CGTCATTGCCTCGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTG 1688 
CAATGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAA 3169 

II Ml M III II I II M IM I II il M II II I 

CAATGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAA 1748 
AATATGCCCAAATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATA 322 9 

M M Mill IMM II IMMI IM MMM I III! IM IM 

AATATGCCCAAATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATA 18 08 
AGCTAGTAATGTTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAA 3289 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

AGCTAGTAATGTTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAA 1868 
CAATGTGG CCAGAAAGAAAGAG CAATAATAATTAATTCACACACCATATGGATTCTATTT 334 9 

IIIIIIIIIMIIIMIIIIIIIIIMIMIIMIIIMMIIIIIIIIIIIIIIIIIII 

CAATGTGG CCAGAAAGAAAGAG CAATAATAATTAATTCACACAC CATATGGATTCTATTT 1928 
ATAAATCACCCAOW^CTTGTTCTTTAATTTC^TCCCAATCACTTTTTCAGAGGCCTGT^ 34 09 

II Ml MIMI II I :M II IM I IM IMM Ml IM IM 

ATAAATCACCCACAAACTTGTTCTTTAATTTCATC 1988 
ATCATAGAAGTCATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCA 34 69 

Ml Ml I Ml IMM Ml II MIMM IMIIMM III III I! 

ATCATAGAAGTCATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCA 2 048 
CAGTTTATTAATATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACT 3529 

M Ml IM III II MMI IM II I Ml IMM II MMM 

CAGTTTATTAATATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACT 2108 
GAATTTTTACATCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATC 3589 

Ml IM I Ml MMM M II II !M II Ml I MM IM M 

GAATTTTTACATCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATC 2168 
TTGCCAAATTTTGAAACTACACACAAAAAGCMACT 364 9 

IIIMIMIIIIIIIIIIIIIIIIMIIIIMIIMIIIIIIIIIIIIIIIIIMIIIII 

TTGCCAAATTTTGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAA 2228 

TTCAGTGGCTTTTTAAAAAAAATGTTTGATTC^Uy^CTTTAACATACTGATAAGTAAGAA 37 09 

MIMIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIII 
TTCAGTGGCTTTTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAA 2288 

ACAATTATAATTTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTA 3769 

IIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 

ACAATTATAATTTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACT 234 8 
377 0 TCAAAACATGTTTCCTAGTATTAAGGACTTTAATATAGC^^ 3829 
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II M, II IMM IM II II IIMI II II II I III MMM 

TCAAAACATGTTT C CTAGTATTAAGGACTTTAATATAG CAACAGACAAAATTATTGTTAA 24 08 
CATGGATGTTACAG CTCAAAAGATTTATAAAAGATTTTAACCTATTTT CTCCCTTATTAT 3889 

M M IM IMM MM I II Mill II M II I Ml MMM 

CATGGATGTTACAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTAT 2468 



1 1 J 1 1 1 1 1 1 i 1 1 1 1 ! 1 1 1 1 1 i 1 1 1 J I j I J 1 1 1 1 1 j j 1 1 1 1 1 ! 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 



AAAGGAATACAGTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAA 4 009 

M II II II II II I MMM IMMM I III I MM MMM 

AAAGGAATACAGTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAA 2588 
TATAACAATGTAAAAAATTATATATCTGGGAGGATTTTTTGGTTG CCTAAAGTGG CTATA 4 069 

I Ml II Ml I MM M IM II IMMM I MMM MMMMM 

TATAACAATGTAAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATA 2 648 
GTTACTGATTTTTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTA 412 9 

II MMM II M II I Ml MMM II MMI IMM II II 

GTTACTGATTTTTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTA 2708 
CCTTATTTTTCACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAA 4189 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIII 

CCTTATTTTTCACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAA 2768 
TATAAATGTGAC^GTGGAC^TTATTTATGTTAAATATAC^TTATCAAGC^GTATGAA 424 9 

IIIMIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMII 

TATAAATGTGACAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAA 282 8 
GTTATTCAATTAAAATGCCACATTTCTGGTCTCTGGG 4286 

IM MMM I! MMM I II Ml I M 

GTTATTCAATTAAAATGCCACATTTCTGGTCTCTGGG 2865 
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AL139002 183337 bp DNA linear PRI 28-JAN-2001 

Human DNA sequence from clone RP11-318G21 on chromosome 
13q22 . 2-31 . 1 , complete sequence. 
AL139002 

AL139002 .18 GI : 12597038 
HTG . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 183337) 
Wall,M. 

Direct Submission 

Submitted (28-JAN-2001) Sanger Centre, Hinxton, Cambridgeshire, 
CB10 ISA, UK. E-mail enquiries: humquery@sanger.ac.uk Clone 
requests : clonereques t@sanger .ac.uk 

On Jan 29, 2001 this sequence version replaced gi: 12584355. 
During sequence assembly data is compared from overlapping clones. 
Where differences are found these are annotated as variations 



together with a note of the overlapping clone name. Note that the 
variation annotation may not be found in the sequence submission 
corresponding to the overlapping clone, as we submit sequences with 
only a small overlap as described above. 

This sequence has been finished according to sequence map criteria 
as follows. An attempt is made to resolve all sequencing problems, 
such as compressions and repeats, but not necessarily within known 
annotated repeat sequence elements. Where the sequence is 
ambiguous, there is an annotation using the 'unsure 1 feature key. 
The following abbreviations are used to associate primary accession 
numbers given in the feature table with their source databases: 
Em:, EMBL; Sw: , SWISSPROT; Tr : , TREMBL; Wp : , WORMPEP; Information 
on the WORMPEP database can be found at 

http : / /www. sanger . ac . uk/Proj ects/ C_elegans/wormpep This sequence 
was generated from part of bacterial clone contigs of human 
chromosome 13, constructed by the Sanger Centre Chromosome 13 
Mapping Group. Further information can be found at 
http : //www . Sanger . ac . uk/HGP/Chrl3 

RP11-318G21 is from the library RPCI-11.2 constructed by the group 
of Pieter de Jong. For further details see 
http : //www . chori . org/bacpac/home . htm 
VECTOR: pBACe3 . 6 

This sequence is the entire insert of clone RP11-318G21 The true 
left end of clone RP11-267I18 is at 125528 in this sequence. 
FEATURES Location/Qualifiers 
source 1 . . 183337 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref="taxon: 9606" 
/ chr omo s ome ="13" 
/map="q22. 2-31.1" 
/clone="RPll-318G21" 
/clone_lib="RPCI-ll .2" 
_region 3. .4 54 

/note="LlMC5 repeat: matches 7127. .7575 of consensus" 
_region 1216. .1308 

/note="HALl repeat: matches 1475. .1563 of consensus" 
_region 1309. .1597 

/note="AluJb repeat: matches 1. .297 of consensus" 
jregion 1598. .2044 

/note="HALl repeat: matches 1003. .1475 of consensus" 
_region 2148. .2276 

/note="L2 repeat: matches 2620. 
_region 2330. .2378 

/note="L2 repeat: matches 2442. 
_region 3915. .4224 

/note="AluY repeat: matches 1. 
_region 4617. .4750 

mer cc 61% conserved" 



repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 



.274 9 of consensus" 



.2492 of consensus" 



.3 06 of consensus" 



/note="67 copies 
region 4648. .4727 

/note="20 copies 4 mer cctt 78% conserved" 
_region 4729. .4784 

/note="14 copies 4 mer tcct 78% conserved" 
region 5431. .5736 

/note="AluSx repeat: matches 1. 
_region 11990. .12273 

/note="AluSx repeat: matches 9. 



.305 of consensus" 



.2 92 of consensus" 



repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

misc_f eature 

repeat_region 

repeat_region 

repeat_region 

r ep ea t _r eg i on 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

r ep ea t _r eg i on 



,290 of consensus" 
.191 of consensus" 



12589. .12809 

/note="MIR repeat: matches 7. .234 of consensus" 
13390. .13519 

/note="L2 repeat: matches 2410. .2548 of consensus" 
14630. .14978 

/note= " THE1B repeat: matches 1. .360 of consensus" 
15092. .15580 

/note="LlMBl repeat: matches 5656. .6116 of consensus 
15581. .16095 

/note="LlPA7 repeat: matches 5629. .6143 of consensus 
16096. .16549 

/note="LlMBl repeat: matches 5188. .5656 of consensus 
16731. .16777 

/note="MIR repeat: matches 35. .78 of consensus" 
16778. .17137 

/note="THElC repeat: matches 1. .371 of consensus" 
17138. .17273 

/note="MIR repeat: matches 78. .226 of consensus" 
17374. .17484 

/note="MIR repeat: matches 26. .158 of consensus" 
17485. .17777 

/note="AluSc repeat: matches 1, 
17778. .17815 

/note="MIR repeat: matches 158. 
18981. .19048 

/note="34 copies 2 mer tt 66% conserved" 
19447. .19589 

/note="MIR repeat: matches 131. .262 of consensus" 
19843. .20162 

/note="MER33 repeat: matches 1. .324 of consensus" 
20866. .21198 

/note="MER44A repeat: matches 3. .333 of consensus" 
21742. .21878 

/note="MIR repeat: matches 9. .154 of consensus" 
22214. .22310 

/note="MIR repeat: matches 164. .260 of consensus" 
22321. .22418 

/note="LlMB8 repeat: matches 6078. .6171 of consensus 
22390. .22715 

/note=" Sequence from AC018674 sequenced by WUGSC. " 
22419. .22730 

/note="AluY repeat: matches 1. .311 of consensus" 
22731. .23714 

/note="LlMB8 repeat: matches 5130. .6078 of consensus 
23715. .24008 

/note="AluSg repeat: matches 1. .2 94 of consensus" 
24009. .24264 

/note="LlMB8 repeat: matches 4884. .5130 of consensus 
24265. .24569 

/note="AluY repeat: matches 1. .305 of consensus" 
24570. .25577 

/note="LlMB8 repeat: matches 3786. .4884 of consensus 
25582. .25635 

/note="27 copies 2 mer tt 70% conserved" 
26221. .26571 

/note="MSTA repeat: matches 1. .347 of consensus" 
26572. .26750 







/note= 


"AluY repeat: matches 12 9. .3 07 of consensus" 


repeat^ 


jregion 


26752. 


.27066 






/note= 


"AluY repeat: matches 1. .311 of consensus" 


rep eat _ 


_region 


27067. 


.27134 






/note= 


"MSTA repeat: matches 347. .371 of consensus" 


repeat_ 


_region 


27963 . 


.28006 






/note= 


"22 copies 2 mer tt 75% conserved" 


repeat_ 


_region 


28916. 


.29282 






/note= 


"MER39 repeat: matches 13. .381 of consensus" 


repeat 


_region 


29282 . 


.29517 






/note= 


"MER39b repeat: matches 327. .579 of consensus" 


repeat 


_region 


30210. 


.30526 






/note= 


"AluJo repeat: matches 1. .303 of consensus" 


repeat 


_region 


31423 . 


.31572 






/note= 


"L1PA13 repeat: matches 6005. .6155 of consensus 


repeat_ 


_region 


31587 . 


.31624 






/note= 


"19 copies 2 mer tt 86% conserved" 


repeat^ 


_region 


32103 . 


.32181 






/note= 


"ORSL repeat: matches 390. .4 67 of consensus" 


repeat_ 


_region 


33878 . 


.34312 






/note= 


"MER57A repeat: matches 1. .433 of consensus" 


repeat^ 


_region 


36673 . 


.36768 






/note= 


"LTR37A repeat: matches 81. .172 of consensus" 


repeat_ 


_region 


36769 . 


.37066 






/note= 


"AluSq repeat: matches 1. .296 of consensus" 


repeat 


region 


37067 . 


.37300 






/note= 


"LTR37A repeat: matches 172. .424 of consensus" 


repeat_ 


jregion 


39470. 


.39501 






/note= 


"16 copies 2 mer tt 90% conserved" 


repeat 


_region 


41434 . 


.42607 






/note= 


"L1M4 repeat: matches -258. .888 of consensus" 


repeat_ 


_region 


42744 . 


.43220 






/note= 


"L1M4 repeat: matches 1085. .1580 of consensus" 


repeat_ 


_region 


43703 . 


.44007 






/note= 


"AluJb repeat: matches 1. .305 of consensus" 


repeat_ 


jregion 


44019 . 


.44180 






/note= 


"L1MD1 repeat: matches 6044. .6211 of consensus" 


repeat_ 


_region 


44183 . 


.44485 






/note= 


"AluJo repeat: matches 1. .300 of consensus" 


repeat_ 


_region 


44486 . 


.44611 






/note= 


"L1MD2 repeat: matches 5949. .6066 of consensus" 


repeat_ 


_region 


45256. 


.45430 






/note= 


"MER5B repeat: matches 1. .178 of consensus" 


repeat 


_region 


45588 . 


.45669 






/note= 


"MER5A repeat: matches 109. .188 of consensus" 


repeat 


_region 


46919 . 


. 47084 






/note= 


"MIR repeat: matches 94. .260 of consensus" 


repeat_ 


_region 


47873 . 


.47902 






/note= 


"15 copies 2 mer tg 90% conserved" 


repeat_ 


region 


49906 . 


.49933 






/note= 


"7 copies 4 mer tgtg 96% conserved" 


repeat_ 


jregion 


50452. 


.50507 






/note= 


"LTR37A repeat: matches 128. .184 of consensus" 


repeat_ 


_region 


51786. 


.51829 






/note= 


"MER74A repeat: matches 271. .309 of consensus" 


repeat_ 


region 


51830. 


. 52172 






/note= 


" THE IB repeat: matches 1. .364 of consensus" 



repeat 


region 


52173 . 


. 52221 








/note= 


"MER74A repeat: matches 


221 271 of consensus" 


rep eat _ 


jregion 


52710. 


. 53396 








/note= 


"L1MB7 repeat: matches 


5451. .6171 of consensus" 


rep eat _ 


jregion 


53890. 


.53996 








/note= 


"MIR repeat: matches 35 


. .152 of consensus" 


repeat 


region 


54259. 


.54296 








/note= 


"19 copies 2 mer tt 84% 


conserved" 


repeat_ 


region 


54299. 


. 55519 





/note="LlPA3 repeat: matches 4925. .6146 of consensus" 

Query Match 66.1%; Score 2841.8; DB 9; Length 183337; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 2854; Conservative 0; Mismatches 2; Indels 1; Gaps 1; 



Qy 1430 AGT CATG CTTATG CTG CTGGTG C CAGT CATTTGAAGAAAAA CAGT CCTTGGAGGAAAAG C 14 89 

IMIIIIIIIMIIIIIIIIIIIIIIIIIilll llllllllllll lllllllllll III 

Db 72830 AGTCATG CTTATG CTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGC 72771 

Qy 14 90 AGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATA 154 9 

MM II I MM 1 1 II II III II II IIIIIIIM MM II II II II I II II II II II III I' 

Db 72770 AGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATA 72 711 

Qy 155 0 AATACAGCTCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACC 1609 

MMMMMMMIMMMMMMMMMMMMMMMMMMMMIMM 

Db 72710 AATACAGCTCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACC 72651 

Qy 1610 GAAGTCATTAAAACAAAATGAAACATTTGCCAAAACAA 1669 

IMM MIMIIMIMM ill IM I IM II MMIM II MMM 

Db 72650 GAAGT CATTAAAA CAAAATGAAACATTTG C CAAAACAAAACAAAAAA CTATGTATTTG CA 72591 

Qy 1670 CAGCACACTATTAAAATATTAAGTGTAATTATTTTAACACTC^ 172 9 

MMIIIIIIIIMIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIMIIIMIIIIMI 

Db 72590 CAG CACACTATTAAAATATTAAGTGTAATTATTTTAACACT CACAG CTACATATGACATT 72531 

Qy 1730 TTATGAGCTGTTTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGA 1789 

MMMMMMMIMMMIMMMMMMMMMIMMIMMMMMIMI 

Db 72530 TTATGAGCTGTTTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGA 72471 

Qy 1790 AAGCACTTAATTTTTTAC^GTTAGCACTTCAACATAGCTCTTAACAA 1849 

IMMMMMIMIMM Ml III I II II MMIM II ill M 

Db 72470 AAGCACTTAATTTTTTACAGTTAGCACTTC^CATAGCTCTTAACAACTTCCAGGATA 72411 

Qy 1850 CACAC^CACTTAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAG 1909 

IIMMMIIIIIMIIill ill Mi I II IMMIMM II II M 

Db 72410 CACACAACACTTAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAG 72351 

Qy 1910 ATTTATTTTTAAATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGA 1969 

I M Ml III Ml II Ml II I II II MIMIi II Ml M 

Db 7235 0 ATTTATTTTTAAATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGA 72291 

Qy 197 0 ACTTTTAAATGAAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTT 2029 

II ! II IM II IM IM II M Ml III II I Ml IMM 

Db 722 90 ACTTTTAAATGAAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAA(^ACTTT 72231 

Qy 2030 TCAATTAATATTATCACACTATTATCAGATTGTAATTAGATG CAAATGAGAGAG CAGTTT 2 08 9 

I IIIIIIMM MIMI III Ml M II MMIIMI: IM II I 



Db 


72230 


Qy 


2090 


Db 


72170 


Qy 


2150 


Db 


72110 


Qy 


2210 


Db 


72050 


Qy 


2270 


Db 


71990 


Qy 


2330 


Db 


71930 


Qy 


2390 


Db 


71870 


Qy 


2450 


Db 


71810 


Qy 


2510 


Db 


71750 


Qy 


2570 


Db 


71690 


Qy 


2630 


Db 


71630 


Qy 


2690 


Db 


71570 


Qy 


2750 


Db 


71510 


Ov 


2810 


Db 


71450 


Qy 


2870 


Db 


71390 



72230 TCAATTAATATTATCA(^CTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTT 72171 

AGTTGTTGCATTTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAG 214 9 

M II I II IMIMI II I II MM II MMM MM 

AGTTGTTGCATTTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAG 72111 

AGCAAGGCTGTTTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTCTG 2209 

MM MM MM I Ml MMI MMM II I MM II 

AGCAAGGCTGTTTTTGAAAATCATTACACTTTC^CTAGAAGCCCAAACCTCAGCATTCTG 72 051 

CAATATGTAACCAACATGTCACAAACAAGC^ 2269 

IIIMMMIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CAAT ATG TAA C CAA CATG T CA CAAA CAAG CAG CATGTAA CAGA CTGG CA CATGTG C CAG C 71991 

TGAATTTAAAATATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAG 232 9 

II' II II I 

TGAATTTAAAATATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAG 71931 



1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II II 1 1 1 II I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



71871 



TCTGTCATTCACATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAA 244 9 

MM MM MM IM II MMM II I Ml IMIMMMMM 

TCTGTCATTCACATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAA 71811 

ATCTTCTTTTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTT 25 09 

llllllll MIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIMIIIIMII 

ATCTTCTTCTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTT 71751 

ACCTACATACACTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTTTAC 2569 

I MM MMIMI II MM MM II MMI II III III 

ACCTACATACACTG CATGTAGATGATTAAATGAGGG CAGGCCCTGTG CTCATAG CTTTAC 71691 

GATGGAGAGATGCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCAC 2629 

' II MM II MMMMIMM IIMIIIIMIMIIMI III II 

GATGGAGAGATGCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCAC 71631 

ATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGC 2689 

M M MMI MM MMI Ml I II I! II III Ml 

ATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGC 71571 

ATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTA 274 9 

I MMI I! MM II I MMMMMMMI MMIMI MMM 

ATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTA 71511 

ACAG CTACCTGTAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAA 2809 

Ml MM IMMMMI II MMI IMM MMIM Ml II 

ACAG CTACCTGTAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAG CCAATAGAA 714 51 

AAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGT 2869 

'Ml IMMMMMI II MIMI III II II II II MMM 

AAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGT 713 91 

AAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCT 2 929 

Mill || II 1 1 II II II MMI II 

AAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCT 71331 



Qy 


2930 


Db 


71330 


Qy 


2990 


Db 


71270 


Qy 


3050 


Db 


71210 


Qy 


3110 


Db 


71150 


Qy 


3170 


Db 


71090 


Qy 


3230 


Db 


71030 


Qy 


3290 


Db 


70970 


Qy 


3350 


Db 


70910 


Qy 


3410 


Db 


70850 


Qy 


3470 


Db 


70790 


Qy 


3530 


Db 


70730 


Qy 


3590 


Db 


70670 


Qy 


3650 


Db 


70610 


Qy 


3710 


Db 


70551 



TAGGATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAG 2 989 

MMMIMMIMMMMMMMMIMIMMMMMIMMMIMIMIMM 
TAGGATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAG 71271 

GAAATGAGGTGGGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTT 304 9 

1 1 i 1 1 1 1 I I I I I I I I II Ml I , II I I II : 1 1 1 Ml 1 1 1 1 1 1 1 i 1 1 1 Mi 1 1 1 1 1 II I 

GAAATGAGGTGGGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTT 71211 
CGTCATTGCCTCGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTG 3109 

II II III II Ml 1 1 II I Mill II MM Mill MM II I II I II II III II Mill III 

CGTCATTGCCTCGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTG 71151 
CAATGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAA 3169 

MMMMMMMMMMMMMMMMMMMMIMMMMIMMIIMM 

CAATGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAA 71091 
AATATGCCCAAATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATA 3229 

I II M II I Ml M II I Ml II, I IMM MMM II 

AATATGCCCAAATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATA 71031 



I II I I I I I I I II I II I I II I I II I II II II I II II I II I I I II I II I I I I I I I M II I II 



CAATGTGGCCAGAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTT 334 9 

M ! IM I II II Ml M II I Ml IM I III I Ml MM 

CAATGTGG CCAGAAAGAAAGAG CAATAATAATTAATTCACACAC CATATGGATTCTATTT 70911 
ATAAATCACCCACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTT 34 09 

1 I MM I M M Ml M II I Ml IM M III ! Ml MM 

ATAAATCACCCACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTT 70851 
ATCATAGAAGTCATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCA 34 69 

II III MM III MM III MM Ml Mlllllll.il Ml Ml INI MM 

ATCATAGAAGTCATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCA 707 91 
CAGTTTATTAATATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACT 352 9 

M I III I M I MIMM II I III IMM MM I III MM 

CAGTTTATTAATATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACT 70731 
GAATTTTTACATCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATC 358 9 

MMMMIIMMMIMM r ! 1 1 1 1 1 1 1 1 1 f 1 1 1 1 ] 1 1 1 1 1 1 1 1 1 r 1 1 f I i 1 1 1 1 1 1 

GAATTTTTACATCCTGATACCTTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATC 70671 
TTGCCAAATTTTGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCA 364 9 

I I MMM II IM II I MM I MM! I II M 1 IMM I 

TTGCCAAATTTTGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATTGCA 7 0611 
TTCAGTGGCTTTTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAA 3709 

M II M I II II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II I II 1 1 II 1 1 1 M 1 1 1 1 1 II 1 1 II 1 1 M 1 1 1 

TTCAGTGGCTTTTT -AAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAA 70552 
ACAATTATAATTTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACT 3769 

I MMM I MM M II I MM I IMM IMM MMM II M 

ACAATTATAATTTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACT 7 04 92 



Qy 3770 TCAAAACATGTTTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAA 3829 

I Mill IMM III M MM! M II M I! I MM MIM 

Db 70491 TCAAAACATGTTTCCTAGTATTAAGGACTTTAATATAGCAACAGAC^^ 70432 

Qy 3830 CATGGATGTTACAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTAT 388 9 

IIIIMIIIIIIIIMIIMIIIIIIIIMIIIIIMIIIIIIIIIIIIIIMIMMII 

Db 70431 CATGGATGTTACAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTAT 7 0372 

Qy 38 90 CCACTGCTAATGTGGATGTATGTTCAAACACCTTTTAGTATTGATAG CTTACATATGG CC 3 94 9 

IIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIMIIIIIIMIIMMIIIII 

Db 70371 CCACTGCTAATGTGGATGTATGTTCAAACACCTTTTAGTATTGATAG CTTACATATGG CC 70312 

Qy 3950 AAAGGAATACAGTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAA 4 009 

MM II Ml I III I Ml M III IM I MM ll II MM 

Db 70311 AAAGGAATACAGTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAA 70252 

Qy 4 010 TATAACAATGTAAAAAATTATATAT CTGGGAGGATTTTTTGGTTGC CTAAAGTGG CTATA 4 069 

1 1 1 1 i ! 1 1 1 ! I E 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 E 1 1 1 ! 1 1 E 1 1 1 1 1 j 1 1 1 1 

Db 70251 TATAACAATGTAAAAAATTATATAT CTGGGAGGATTTTTTGGTTGC CTAAAGTGG CTATA 70192 

Qy 4 070 GTTACTGATTTTTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTA 412 9 

M Mill Ml M III M IM IMM Ml II II M M Ml II 

Db 70191 GTTACTGATTTTTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTA 70132 

Qy 413 0 CCTTATTTTTCACTGTACAGACA.CTAATTCATTAAATACTAATTGATTGTTTAAAAGAAA 418 9 

IIIIIIIIMMIMIIMIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

Db 7 0131 CCTTATTTTTCACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAA 7 0072 

Qy 4190 TATAAATGTGAC^GTGGACATTATTTATGTTAAATATAC^TTATCAAGCAAGTATGAA 424 9 

IMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMM 

Db 70071 TATAAATGTGACAAGTGGACATTATTTATGTTAAATATA 70012 

Qy 4250 GTTATTCAATTAAAATGCCACATTTCTGGTCTCTGGG 4286 

II Mill III I MM I; II Mill M 

Db 70011 GTTATTCAATTAAAATGCCACATTTCTGGTCTCTGGG 69975 
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AC144750/C 
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AC144750 201093 bp DNA linear HTG 04-JUN-2003 

Pan troglodytes clone CH251-517B22 , WORKING DRAFT SEQUENCE, 3 
ordered pieces . 
AC144750 

AC144750.2 GI :31376422 
HTG; HTGS_PHASE2; HTGS_DRAFT . 
Pan troglodytes (chimpanzee) 
Pan troglodytes 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 201093) 

Antonellis,A. , Ayele,K., Beckstrom-Sternberg, S . M . , Benjamin, B. , 
Blakesley,R.W. , Bouf fard # G.G. , Brinkley,C, Brooks, S., Cariaga,K., 
Chu,G., Coleman, B., Coleman, H., Engle,J., Granite, S., Guan,X., 
Gupta, J., Haghighi,P., Han, J., Hansen,N., Ho,S.-L., Hu,P., 
Hurle,B., Idol, J. R., Karlins,E., Kwong,P., Laric,P., Lee-Lin, S . -Q . , 
Legaspi,R., Maduro,Q.L., Maduro,V.B., Margul ies , E . H . , Masiello,C, 
Maskeri,B., McDowell, J., Paguirigan, C . , Pearson, R. , Portnoy , M . E . , 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Pan. 



Prasad, A. , Reddix-Dugue, N . , Schandler , K. , Schueler,M.G. , Shah,K., 
Sison,C, Stantripop, S. , Thomas, J. W. , Thomas, P. J., Tsipouri,V., 
Vogt,J.L., Wetherby, K.D. , Wiggins,L., Young,A. and Green, E.D. 

TITLE NISC Comparative Sequencing Initiative 

JOURNAL Unpubl ished 
REFERENCE 2 (bases 1 to 201093) 

AUTHORS Green, E.D. 

TITLE Direct Submission 

JOURNAL Submitted ( 15-MAY-2003 ) NIH Intramural Sequencing Center, 8717 
Grovemont Circle, Gaithersburg, MD 20877, USA 
REFERENCE 3 (bases 1 to 201093) 
AUTHORS Green, E.D. 
TITLE Direct Submission 

JOURNAL Submitted ( 04 - JUN-2003 ) NIH Intramural Sequencing Center, 8717 
Grovemont Circle, Gaithersburg, MD 20877, USA 
COMMENT On Jun 4, 2003 this sequence version replaced gi: 30725907. 

Genome Center 

Center: NIH Intramural Sequencing Center 
Center code: NISC 

Web site: http://www.nisc.nih.gov 
Contact: nisc_zoo@nhgri.nih.gov 

Project Information 

Center project name: esg 
Center clone name: 517B22 



The sequence data in this record represents an 'enhanced' 
version of a Phase 2 submission. Specifically, the indicated 
order and orientation of each sequence contig has been 
established using one or more of the following: read-pair 
data from individual subclones, overlaps with neighboring 
clones, alignment with available reference sequence (e.g., 
human), and/or confirmation by PCR testing. In addition, 
the sequence assembly is based on at least 8X average 
coverage in Q2 0 bases and has been reviewed to rule out 
gross misassemblies, the low-quality ends of sequence 
contigs have been trimmed away, and each base is associated 
with a Phrap-derived quality score. 

Summary Statistics 

Sequencing vector: plasmid; n/a; 100% of reads 
Chemistry: Dye -terminator Big Dye; 100% of reads 
Assembly program: Phrap; version 0.990319 
Consensus quality: 200649 bases at least Q40 
Consensus quality: 200775 bases at least Q30 
Consensus quality: 200836 bases at least Q20 
Insert size: 165000; agarose-fp 
Insert size: 200893; sum-of -contigs 
Quality coverage: 13.47x in Q20 bases; agarose-fp 
Quality coverage: 11.07x in Q2 0 bases; sum-of -contigs 



* NOTE: This is a 'working draft' sequence. It currently 

* consists of 3 contigs. Gaps between the contigs 

* are represented as runs of N. The order of the pieces 

* is believed to be correct as given, however the sizes 

* of the gaps between them are based on estimates that have 

* provided by the submit tor. 

* This sequence will be replaced 

* by the finished sequence as soon as it is available and 



the accession number will be preserved. 



1 

107547 
107647 
153001 
153101 



contig of 107546 bp in length 
gap of unknown length 
contig of 45354 bp in length 
gap of unknown length 
contig of 47993 bp in length. 



FEATURES 

source 



misc feature 



misc feature 



misc feature 



misc feature 



BASE COUNT 
ORIGIN 



61806 a 



107546: 
107646: 
153000: 
153100: 
201093 : 
Location/Qualif iers 
1. .201093 

/organism="Pan troglodytes" 
/mol_type= "genomic DNA" 
/db_xref ="taxon: 9598" 
/clone="CH251-517B22" 
/clone_lib=" CH251" 
1. .107546 

/note="assembly_fragment 
clone_end: T7 
vector_side : left " 
83861. .201093 

/note="clone overlaps with GenBank Accession Number 

AC144499 clone RP43-108D19 (center project name esf ) " 

107647. .153000 

/note="assembly_fragment" 

153101. .201093 

/note="assembly_fragment 

clone_end: SP6 

vector_side: right" 

36616 c 38060 g 64411 t 200 others 



Query Match 64.9%; Score 2792.4; DB 2; Length 201093; 

Best Local Similarity 99.2%; Pred. No. 0; 

Matches 2838; Conservative 0; Mismatches 16; Indels 6; Gaps 



3; 



Qy 1430 AGTCATGCTTATG CTGCTGGTG CCAGTCATTTGAAGAAAAACAGTC CTTGGAGGAAAAGC 14 8 9 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIII 

Db 69717 AGTCATG CTTATG CTG CTGGTG CCAGT CATTTGAAGAAAAACAGTC CTTGGAGGAAAAGC 69658 



Qy 



Db 



14 90 AGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATA 154 9 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIMIIIIIII 

69657 AGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATA 69598 



Qy 155 0 AATACAGCTCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACC 1609 

I ( II .11 II II M il I MM I I III I! II! I il 

Db 69597 AATACAGCTCATCTTGAAAGAAGAACTATTCACTGTATCTCATTTTCTTTATATTGGACC 69538 

Qy 1610 GAAGTCATTAAAACAAAATGAAACATTTGCCAAAACAAA^ 1669 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIII 

Db 69537 GAAGTCATTAAAACAAAATGAAACATTTGCCAAAACA^ 69478 

Qy 167 0 CAGCACACTATTAAAATATTAAGTGTAATTATTTTAAC^ 172 9 

IIIMIMMIIIIIMIIIIIIIMIIMIIIMIIIIIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 69477 CAG(^CACTATTAAAATATTAAGTGTAATTATTTTAACACT(^VTAGCTACATATGACATT 69418 

Qy 173 0 TTATGAGCTGTTTACGG CATGGAAAGAAAATCAGTGGGAATTAAGAAAG CCTCGTCGTGA 178 9 

IIIMIIIIIIIIMIMIIMIIIIIMIIIMIIIIIIIIIIIIIIIIIII IIIMI 

Db 69417 TTATGAGCTGTTTACGG CATGGAAAGAAAATCAGTGGGAATTAAGAAAG C CTCATCGTGA 69358 



Qy 


1790 


Db 


69357 


Qy 


1850 


Db 


69297 


Qy 


1910 


Db 


69237 


Qy 


1970 


Db 


69177 


Qy 


2030 


Db 


69117 


Qy 


2090 


Db 


69057 


Qy 


2150 


Db 


68997 


Qy 


2207 


Db 


68937 


Qy 


2267 


Db 


68877 


Qy 


2327 


Db 


68819 


Qy 


2387 


Db 


68759 


Qy 


2447 


Db 


68699 


Qy 


2507 


Db 


68639 


Qy 


2567 


Db 


68579 


Qy 


2627 



AAGCACTTAATTTTTTACAGTTAGC^CTTCAAC^TAGCTCTTAAC^CTTCCAGGATATT 184 9 

i I IMM MM I II M II IM MMI I MMM IM M I 

AAGCACTTAATTTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATT 692 98 
CACACAACACTTAGGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAG 1909 

I IMIIMI MM I MM I! MMI MMI llllllllllllllllll 

CACACAACACTTAGGCTTAAAAATGAGCTCACTCAGAATTTTTATTCTTTCTAAAAAGAG 69238 
ATTTATTTTTAAATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGA 1969 

IIIMIIIIMIIIIIIIIIIIIIIIIMMIIIIIIIMIIIIIIIIIIIIMIIIMI 

ATTTATTTTTAAATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGA 69178 
ACTTTTAAATGAAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTT 2 02 9 

M I II M II I II I M MMI IM M II MMI M I 

ACTTTTAAATGAAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTT 69118 



1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 1 M 1 1 1 1 1 III lllllllllllllllllllllllllll 



IMIIMMMMMI llllll IMIM III llllll I lllllll lllllll I II II 



AGCAAGGCTGTTTTTGAAAATCATTACA CTTTCACTAGAAGCCCAAACCTCAGCATT 2206 

M IMIIMI ill Mill! I IMMMMMIi I IIMIMM 

AGCAAGGCTGTTTTTGAAAATCATTACACTCCTTTCACTAGAAGCCCAAACCTCAGCATT 68 938 
CTGCAATATGTAACCAACATGTCACAAACAAGC^ 2266 

IIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIMIIIIIIIIMIIIIIMMIII 

CTGCAATATGTAACCAACATGTCACAAACAAGCAGCATGTAACAGACTGGCACATGTGCC 68878 
AGCTGAATTTAAAATATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTT 2326 

I II II I! II II II I I Ml III II II IMM I II 

AGCTGAATTTAAAATATAATACTTTTTTTA - - AAAATTATTACATCCTTTACATTCAGTT 68820 



M MMI M II II M MMM II II Ml IM II MM 1 1 1 



II lllllll III MM lllllll III Mil III MM III lllllll lllllll III III 2446 

GAATCTGTCATTCACATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATT 687 00 
AAAATCTTCTTTTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATA 2 5 06 

Mill M II II M IM I III M Ml llllll IM II II 

AAAATCTTCTTCTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATA 6864 0 
CTTACCTACATACACTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTT 2566 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTTACCTACATACACTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCTT 6858 0 
TACGATGGAGAGATGCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTC 2626 

1 IMMMIIMM MMI MMM MMM IMIM I II I MMI 

TACGATGGAGAGATG C CAGTGACCTCATAATAAAGACTGTGAACTG C CTGGTG CAGTGTC 68520 
2627 CACATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCT 2686 



Db 


68519 


Qy 


2687 


Db 


68459 


Qy 


2747 


Db 


68399 


Qy 


2807 


Db 


68339 


Qy 


2867 


Db 


68279 


Qy 


2927 


Db 


68219 


Qy 


2987 


Db 


68159 


Qy 


3047 


Db 


68099 


Qy 


3107 


Db 


68039 


Qy 


3167 


Db 


67979 


Qy 


3227 


Db 


67919 


Qy 


3287 


Db 


67859 


Qy 


3347 


Db 


67799 


Qy 


3407 


Db 


67739 


Qy 


3467 



1 1 1 II 1 1 1 1 II 1 1 1 I II I INI 1 1 II MM 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 II II! 1 1 1 II 

C^CATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCT 68460 
AGCATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACAT 2746 

IIIIIIIIMIMIIIIMMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AGCATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACAT 684 00 
TTAACAG CTAC CTGTAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAGC CAATA 2806 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

TTAACAG CTAC CTGTAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAG CCAATA 68340 
GAAAAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGAC 2 866 

IIIIIIIIIIIMIIIIIIMIIIIIIMilllllllMIIIIIMIIIIIIMIMIII 

GAAAAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGAC 68280 
CGTAAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTG 2926 

MM . iiM! , IM ' : II ■ , I M ! 

CGTAAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTG 68220 



II M, , ^ 1 M 1 1 



GAGGAAATGAGGTGGGGTTGGAGGAAACC CATGGGGACAGATTCCCATT CTTAG CCTAAC 3046 

II MINIMI I II MINIMI I IMIIIIIIII II III I MINIMI MM III II I 

GAGGAAATGAGGTGGGGTTGGAGGAAACC CATGGGGACAGATTCCCATTCTTAG CCTAAC 68100 
GTTCGTCATTGCCTCGTCACATCAATGCAAAAGGTC 3106 

I I I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 

GTTCGTCATTGCCTCGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAA 6804 0 

GTGCAATGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCT 3166 

I II III I Ml II MUM II I IMM MIMMIII IM I I 

GTGCAATGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCT 67 980 

TAAAATATG CCCAAATTTTTACTTTGTTTTTCTTTTAATAGG CTGGG C CA CATGTTGGAA 322 6 

I Ml II M' IM M II I II I II l| MMI M MM I I 

TAAAA.TATGCCCAAATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCA CATGTTGGAA 67920 
ATAAGCTAGTAATGTTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACC 3286 

I . i I I i I . ' ; I . i ' . ! ! .1 , I 

ATAAGCTAGTAATGTTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACC 67860 
CAACAATGTGGCCAGAAAGAAAGAGCAATAATAAT^ 334 6 

I Ml II M, IM M IM ; II MM II MUM M M I M 

C^C^TGTGGCCAGAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTA 67800 
TTTATAAAT(^CC(^CAAACTTGTTCTTTAATTTCATCCC^TCACTTTTTCAGAGGCCT 34 06 

NNNNNNNNNNINN IIIIIIIINIIIIIIIIIINIIINIIIINI 

TTTATAAATCACCCACAAACTTGTTTTTTAATT^ 67740 

GTTATCATAGAAGTCATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTT 3466 

M ll:MI II , I I II .1 ' II I I ' I j II 111:111,1' II 

GTTATCATAGAAGACATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTT 67680 

TCACAGTTTATTAATATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGT 3526 

1 II M IMM MM IMIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIII 



Db 


67679 


Qy 


3527 


Db 


67619 


Qy 


3587 


Db 


67559 


Qy 


3647 


Db 


67499 


Qy 


3707 


Db 


67440 


Qy 


3767 


Db 


67380 


Qy 


3827 


Db 


67320 


Qy 


3887 


Db 


67260 


Qy 


3947 


Db 


67200 


Qy 


4007 


Db 


67140 


Qy 


4067 


Db 


67080 


Qy 


4127 


Db 


67020 


n\s 
wy 


TC ± O 1 


Db 


66960 


Qy 


4247 


Db 


66900 



6767 9 TCACAGTTTATTAATATATTTTATTTCTATTTAAATTTTAGATTATTTTTATTACCATGT 67620 
ACTGAATTTTTACATCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATT 3586 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIMIII 

ACTGAATTTTTACATCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATT 67560 
ATCTTG CCAAATTTTGAAACTACACACAAAAAG CATACTTG CATTATTTATAATAAAATT 3646 

IMM M II II I lllllll MMM II MM III III IM 

ATCTTGCCAAATTTTGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAATT 67500 
GCATTC^GTGGCTTTTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAA 3706 

II II Ml II I MIMM M IM II II I Ml IM I 

GC^TTCAGTGGCTTTTT-AAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAA 67441 
GAAACAATTATAATTTCTTTA(^TACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCA 3766 

Mill I ill II: I ill II Mill II Mill Ml nr I 

GAAACAATTATAATTTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCA 67381 
ACTTCAAAACATGTTT CCTAGTATTAAGGACTTTAATATAG CAACAGACAAAATTATTGT 3826 

Mill I III II I Ml II Mill II IMM II Ml I 

ACTTCAAAACATGTTT C CTAGTATTAAGGACTTTAATATAG CAACAGACAAAATTATTGT 67321 
TAACATGGATGTTACAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTAT 3886 

II III I II! MIM Ml II Mill II Ml II IMMI M 

TAACATGGATGTTACAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTAT 67261 
TATCCACTGCTAATGTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATG 3946 

IIIIIIIMIIIIIIIIIIIMIIIIIIMIIIIMIIIIMIIIIIIIIIIIMIMII 

TATCCACTGCTAATGTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATG 67201 
GCCAAAGGAATACAGTTTATAG CAAAACATGGGTATG CTGTAG CTAACTTTATAAAAGTG 4 006 

Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I! 1 1 Ml I Ml I I MINI II II 1 1 1 II II 1 1 1 1 II 1 1 

G CCAAAGGAATACAGTTTATAG CAAAACATGGGTATG CTGTAG CTAACTTTATAAAACTG 67141 
TAATATAACAATGTAAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCT 4 066 

M II I: IMMI M II IMM IMM! IM I III MM I 

TAATATAACAATGTAAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCT 67081 



CTACCTTATTTTTCACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAG 

II IMM II: II M M II IMM IIMIMM IM II II 

CTACCTTATTTTTCACTGTACAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAG 
AAATATAAATGTGACAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAGTAT 

Ml II Mill II M IM III II IMMI II I III MM I 

AAATATAAATGTGAC^GTGGAC^TTATTTATGTTAAATATACAATTATC^GCAAGTAT 
GAAGTTATTCAATTAAAATGCCACATTTCTGGTCTCTGGG 4286 

IIIIIMIIIIIMMIIIIIIIMIIMIIIIMIIIII 



67021 



RESULT 10 
G06417 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



G06417 2720 bp DNA linear STS 19-OCT-1995 

human STS WI-7149, sequence tagged site. 

G06417 

G06417.1 GI :859662 

STS; STS sequence; primer; sequence tagged site. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 2720) 
Hudson, T. 

Whitehead Institute/MIT Center for Genome Research; Physically 
Mapped ESTs 
Unpublished (1995) 

Contact : Thomas Hudson 

Whitehead Institute/MIT Center for Genome Research 

Whitehead Institute for Biomedical Research 

9 Cambridge Center, Cambridge MA 02142 USA 

Tel: 617 252 1900 

Fax: 617 252 1902 

Email : thudson@genome . wi . mit . edu 



Primer A: ATGGAGAGATGCCAGTGACC 
Primer B: TAGG CAGGAA CG CACAAAG 
STS size: 331 
PCR Profile: 

Presoak: 
Denaturation: 
Annealing: 56 degrees C 
Polymerization : 
PCR Cycles: 35 
Thermal Cycler: 
Protocol : 

Template: 10 ng 

Primer: each 5 pM 

dNTPs : each 4 nM 

Taq Polymerase: 0.025 units/ul 

Total Vol: 20 ul 



Buffer: 

MgC12: 1.5 mM 
KC1: 50 mM 
Tris-HCL: 10 mM 
pH: 9.3 



Prepared with primer pairs derived from D90402 -- Unigene. 
FEATURES Locat ion/Qual i f iers 

source 1. .2720 

/organism= n Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9606" 

/map="710_D_4; 788_D_1; 795_F_4; 921_F_2; 94 0_G_8; 
969_D_1" 
STS 1005. .1335 



primer__bind 
primer bind 



1005. .1024 

complement (1317 . . 1335) 



BASE COUNT 894 a 438 c 408 g 870 t 110 others 

ORIGIN 



Query Match 60.7%; Score 2610; DB 11; Length 2 72 0; 

Best Local Similarity 96.0%; Pred. No. 0; 

Matches 2610; Conservative 0; Mismatches 110; Indels 0; Gaps 0; 

AAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAAAACAAA 162 6 

II II IMMI II Ml M IIIMM IM IM I II I II 

AAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTNNNNNNNN 6 0 

ATGAAACATTTGCCAAAACAAAACAAA 1686 

M III I MM I IIIMIIIII 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTATGTATTTGCACA.GCACACTATTAAAAT 120 

ATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGTTTACGG 1746 

i I ; I ill ! 1 1 1 1 ' 1 1 . I II ll.ll 1 1 1 ! 1 : 1 1 1 1 : l , (I j I 1 1 ' 

ATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGCTGTTTACGG 180 
CATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAATTTTTTA 1806 

I I Ml II ir II- ; M inn M I' II :m ii ii ii 

CATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAATTTTTTA 24 0 
CAGTTAGCACTTCAACATAGCTCTTAACAACTTC^ 1866 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIMI 

CAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATTCACAC^ 300 
TAAAAATGAG CTCACTCAGAATTTCTATT CTTTCTAAAAAGAGATTTATTTTTAAAT CAA 1926 

II II IMIIMM III Mlllllill IM IM I MM I Mllllllll 

TAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTAAATCAA 360 
TGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATGAAGCTT 1986 

IIMMIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIII 

TGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATGAAGCTT 420 
AAATTACTC^TTTAAAATTTTAAAATCCTTTAAAAC^CTTTTC^TTAATATTATCAC 2 046 

I II IMMI I IMM II Mill II M II Mi II II M 

AAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATATTATCAC 4 80 
ACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCATTTTTCG 2106 

IIMIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

ACTATTATCAGATTGTAATTAGATG CAAATGAGAGAGCAGTTTAGTTGTTG CATTTTTCG 54 0 
GACACTGGAAACATTTAAATGATC^GGAGGGAGTAAC^GAAAGAGC^AGGCTGTTTTTGA 2166 

M M IMMI M II I II II M MMM II I IIMIIMI 

GACACTGGAAAOVTTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGTTTTTGA 600 
AAATCATTACACTTTCACTAGAAGCCCAAACCT 2226 

M M IMMI II illlll Ml MM I II IMIMIMMM II M 

AAATCA/TTACACTTTCA-CTAGAAGCCCA^ 660 
GTCACAAACAAGCAGCATGTAACAGACTC 2286 

M II Ml II M IMM II II M IMMI II M II Ml II 

GTCACAAACAAGCAGCATGTAACAGACTGGCACATGTGCCAGCTGAAT 720 
ACTTTTAAAAAGAAAATTATTACATCCTTTACATTC^GTTAAGATCAAACCTC^CAAAGA 2346 

MIIIIMIIIIIMIIIIIIIIIIMMIIIMMIIMIIIIIIIIIIIIIIIIIIII 



Qy 


1567 


Db 


1 


Qy 


1627 


Db 


61 


Qy 


1687 


Db 


121 


Qy 


1747 


Db 


181 


Qy 


1807 


Db 


241 


Qy 


1867 


Db 


301 


Qy 


1927 


Db 


361 


Qy 


1987 


Db 


421 


Qy 


2047 


Db 


481 


Qy 


2107 


Db 


541 


Qy 


2167 


Db 


601 


Qy 


2227 


Db 


661 


Qy 


2287 



Db 


721 


Qy 


2347 


Db 


781 


Qy 


2407 


Db 


841 


Qy 


2467 


Db 


901 


Qy 


2527 


Db 


961 


Qy 


2587 


Db 


1021 


Qy 


2647 


Db 


1081 


Qy 


2707 


Db 


1141 


Qy 


2767 


Db 


1201 


Qy 


2827 


Db 


1261 


Qy 


2887 


Db 


1321 


Qy 


2947 


Db 


1381 


Qy 


3007 


Db 


1441 


Qv 


3067 


Db 


1501 


Qy 


3127 


Db 


1561 



721 ACTTTTAAAAAGAAAATTATTACATC 780 
GAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCACATACC 24 06 

I II M II I IM I M II Ml II I II I IM M II 

GAAATAGAATGTTTGAAAGGCTATCCC^yVAAGACTTTTTTGAATCTGTCATTCACATACC 84 0 
CTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTTTCACTA 24 66 

MM II MUM MUM II II IM II II M II M IM M II 

CTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTTTCACTA 900 
TCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACACTGCAT 2526 

MUM I II I II II MUM IMIIMM MMI Ml MMM 

TCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACACTGCAT 960 
GTAGATGATTAAATGAGGG CAGG CCCTGTGCTCATAGCTTTACGATGGAGAGATG CCAGT 2586 

I I II Mill; I MM I II IM II II MMI M IM MM 

GTAGATGATTAAATGAGGG CAGG CC CTGTG CTCATAGCTTTACGATGGAGAGATGCCAGT 1020 
GACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGGGCAGGT 2646 

IMIIIIIMIIIIIMIMIIIIIIIMIMIIMIIIIIIIIMIIMIIIIMIMI 

GACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAAGGGGCAGGT 1080 
AGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAATGCTATA 2706 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

AGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTATAATGCTATA 114 0 
GTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTGTAAAGC 2766 

Ml MMM MMM IMMMIMM M M MMM IMIMMIM 

GTTAAAATACTATTTTTCAAAATCATACAGATTAGTACATTTAACAGCTACCTGTAAAGC 12 00 
TTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTGACATGG 282 6 

MMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIMIIII 

TTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTTGCTTGACATGG 1260 
TGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTTAGCTTT 2 886 

M IM I MMM IM M M I II M M IM MMM MM 

TGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAACCTCTTAGCTTT 132 0 
GTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTTGGGATG 294 6 

I M II MMM MMI I II IMIMMIMI IMMI III I IM 

GTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGGATAGCTTGGGATG 1380 
AGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTGGGGTTG 3006 

IIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIMIIIIIIIIIIIMIMIIMMIII . 

AGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAATGAGGTGGGGTTG 144 0 
GAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCTCGTCAC 3 066 

! M IM MMM MMIMM II M IIMMMI l!,|| Ml MlM 

GAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCTCGTCAC 15 00 
ATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTGCAATGTTCTCAGAGTGA 312 6 

I M M, D IM I IM I. I! M II M 1 IM MMI MMMM 

ATCAATGCAAAAGGTCCTGATTTTGTTCCAGC^AACACAGTGCAATGTTCTCAGAGTGA 1560 
CTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAAATTTTT 3186 

M M M Mill MIMIMM II M MMI MM MMI MMMM 

CTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAAATTTTT 162 0 
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3187 


Db 


1621 


Qy 
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Db 
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Qy 


3307 


Db 


1741 


Qy 


3367 


Db 


1801 


Qy 


3427 


Db 


1861 


Qy 


3487 


Db 


1921 


Qy 


3547 


Db 


1981 


Qy 


3607 


Db 


2041 


Qy 


3667 


Db 


2101 


Qy 


3727 


Db 


2161 


Qy 


3787 


Db 


2221 


Qy 


3847 


Db 


2281 


Qy 


3907 


Db 


2341 


Qy 


3967 


Db 


2401 



ACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATGTTGTTT 3246 

IIIIIMIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIlll 

ACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATGTTGTTT 1680 

TCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCAGAAAGA 3306 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 

TCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCAGAAAGA 174 0 

AAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCCACAAAC 3366 

MM M MM II Mill Ml II I I Mi Mill II II MM 

AAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCCACAAAC 1800 

TTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGTCATTTT 342 6 

! ; i : i i i . : . I ; ! I : - 

TTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGTCATTTT 1860 

AGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCACAGTTTATTAATATATT 34 86 

MMM Mill MUM I II 

AGACTCTCAATTTTAAATTAATTTTGAATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 1920 



1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 ! I 



TACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTTTGAAAC 3606 

MM IM MM II MMlMI II; I II MMM III M M MM 

TACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAATTTTGAAAC 204 0 
TACACACAAAAAG CATACTTG CATTATTTATAATAAAATTG CATTCAGTGGCTTTTTAAA 3666 

I IMMM'M II MM:MI II I II IMMI II II IM 

TACACACAAAAAG CATACTTG CATTATTTATAATAAAATTG CA.TTCAGTGGCTTTTTNNN 2100 
AAAAATGTTTGATTCAAAACTTTAA(^TACTGATAAGTAAGAAA(^ATTATAATTTCTTT 3726 

! - ; i . ; , : ; . i i , i i 

NNNNNTGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTATAATTTCTTT 2160 
ACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTC^^ 3786 

IMM III M IMII M I IMM Ml I IIMI II MM MM 

ACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACATGTTTCCTA 222 0 



IMIM I I Ml !M I MMI IM I I MIMI M M II II 



AAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAATGTGGAT 3906 

M M I MMM II M M II III I II IMM MM III II II 

AAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCTAATGTGGAT 234 0 
GTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAA 3966 

I II M M MMM I M Ml II Mi MMI IM MM, Mill 

GTATGTTC^AACACCTTTTAGTATTGATAGCTTACATATGGCO^GGAATAC^GTTTAT 24 00 
AGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGTAAAAAA 4 026 

Mill MMI II MMM II III MMI IMMMMIMM 

AGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAATGTAAAAAA 2460 



Qy 4 027 TTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTTTTTATT 4 086 

I II MliILM M II I Mi I III! I II: II II 

Db 2461 TTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGATTTTTTATT 2520 

Qy 4087 ATGTAAGCAAAA^ 4146 

Db 2521 ATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTAACAACTACCTTATTTTTCACTGTA 2580 

Qy 4147 CAGAC^CTAATTC^TTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGAOWVGTG 4206 

MIIIIIMIIMIIIIIIIIIIIMIIIMIIMIMMIMIMIIIIIIIIIIIIII 

Db 2581 CAGACACTAATTCATTAAATACTAATTGATTGTTTAAAAGAAATATAAATGTGACAAGTG 264 0 

Qy 4207 GACATTATTTATGTTAAATATACAATTAT CAAG CAAGTATGAAGTTATTCAATTAAAATG 4266 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIII IIIIIIMII 

Db 2641 GACATTATTTATGTTAAATATACAATTATCAAGCAAGTATGAAGTTATTCAATTAAAATG 2700 

Qy 4267 CCACATTTCTGGTCTCTGGG 4286 

MIIIIIIMIIIIIIIIII 

Db 2701 CCACATTTCTGGTCTCTGGG 272 0 
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AC130785 169751 bp DNA linear HTG 29-AUG-2002 

Papio anubis clone RP41-325P5, WORKING DRAFT SEQUENCE . 

AC130785 

AC130785. 1 GI : 22218455 
HTG; HTGS_PHASE2; HTGS_DRAFT. 
Papio anubis (olive baboon) 
Papio anubis 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
Cercopithecinae; Papio . 
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Center: NIH Intramural Sequencing Center 
Center code: NISC 
Web site: http://www.nisc.nih.gov 
Contact: nisc_zoo@nhgri.nih.gov 

Project Information 

Center project name: deh 
Center clone name: 325P05 



The sequence data in this record represents an 'enhanced* 
version of a Phase 2 submission. Specifically, the indicated 
order and orientation of each sequence contig has been 
established using one or more of the following: read-pair 
data from individual subclones, overlaps with neighboring 
clones, alignment with available reference sequence (e.g., 
human), and/or confirmation by PCR testing. In addition, 
the sequence assembly is based on at least 8X average 
coverage in Q2 0 bases and has been reviewed to rule out 
gross misassemblies , the low-quality ends of sequence 
contigs have been trimmed away, and each base is associated 
with a Phrap-derived quality score. 

Summary Statistics 

Sequencing vector: plasmid; n/a; 100% of reads 
Chemistry: Dye-terminator Big Dye; 100% of reads 
Assembly program: Phrap; version 0.990319 



Consensus quality 
Consensus quality 
Consensus quality 



169735 bases at least' Q40 
169747 bases at least Q30 
169749 bases at least Q20 
Insert size: 138000; agarose-fp 
Insert size: 169751; sum-of -contigs 
Quality coverage: 10.99x in Q20 bases; agarose-fp 
Quality coverage: 8 . 94x in Q20 bases; sum-of -contigs 



FEATURES 

source 



misc feature 



misc feature 



NOTE: This is a 'working draft' sequence. It currently 

consists of 1 contigs. Gaps between the contigs 

are represented as runs of N. The order of the pieces 

is believed to be correct as given, however the sizes 

of the gaps between them are based on estimates that have 

provided by the submittor. 

This sequence will be replaced 

by the finished sequence as soon as it is available and 
the accession number will be preserved. 

1 169751: contig of 169751 bp in length. 

Location/Qualifiers 

1. .169751 

/organism="Papio anubis" 
/mol_type= "genomic DNA" 
/ sub_species=" anubis" 
/db_xref="taxon: 9555" 
/clone="RP41-325P5" 
/clone_lib="RP41 M 
1. .169751 

/not e=" as sembly_f ragmen t 
clone_end:T7 
vector_side: lef t 
clone_end: Sp6 
vector_s ide : right " 
1. .63149 



/note="clone overlaps with GenBank Accession Number 
AC129069 clone RP41-240D13 (center project name deg) " 

BASE COUNT 51109 a 31316 c 31981 g 55345 t 

ORIGIN 



Query Match 59.3%; Score 2550; DB 2; Length 169751; 

Best Local Similarity 94.8%; Pred. No. 0; 

Matches 2717; Conservative 0; Mismatches 130; Indels 19; Gaps 7; 

AGTCATGCTTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGC 148 9 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIMIIIIMIIIIIIIIMIIIIIII 

AGTCATGCTTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGC 2 9159 
AGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATA 154 9 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

AGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATA 29099 
AATACAGCTC^TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACC 1609 

I II II IMMI MMI I II I Ml ! MMI I Ml I II 

AATACAGCTCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACC 2 9039 
GAAGTCATTAAAACAAAATGAAACATTTC 1669 

M MMII I Mill IIMMMI IIMIIIIIIIIIIIIIIIIIIIIIIIIII 

GAAGT CATTAAAACAAAATGAAA CATTTG T CAAAACAAAACAAAAAA CTATGTATTTG CA 28979 
C^GCACACTATTAAAATATTAAGTGTAATTATTTTAACACT 172 9 

M II II II Ml I Ml I III II II M I I M III M I 

CAGC^CACTATTAAAATATTAAGTGTAATTATTTTAAC^CTCATAGCTACATATGACATT 2 8 919 
TTATGAG CTGTTTACGG CATGGAAAGAAAATCAGTGGGAATTAAGAAAG CCTCGTCGTGA 178 9 

IIIMIIIIIIIIIIIIIIIIIIIIIIMIIMIMI I 1 1 1 1 

TTATGAGCTGTTTACAGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCATTGTGA 28859 
AAGC^CTTAATTTTTTACAGTTAGCACTTCAAO^TAGCTCTTAACAACTTCCAGGATATT 184 9 

MUM Ml II M III II II MMI I MMII M I 

AAGCACTTACTTTTTTATGGTTAGCACTTCAACATAGCTCTTAATAACTTCCAGGATATT 2 8799 
(^CACAACACTTAGGCTTAAAAATGAGCTO^CTCAGAATTTCTATTCTTTCTAAAAAGAG 1909 

Illlllll lllllllllllllllllllllllll lllllllllll Mill - 

CACACAACCCTTAGGCTTAAAAATGAGCTCACTCGGAATTTCTATT TAAGAG 28747 

ATTTATTTTTAAATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGA 1969 

IIIIIIIIIMIIMIIII II lllllll MINIMI 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 

ATTTATTTTTAAATCAATGTGAATCTGATACAAAGGAAGAGTAAGTCACTGTAAAACAGA 28687 
ACTTTTAAATGAAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTT 2 02 9 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMII II MMII M Ml II MMII 

ACTTTTAAATGAAGCTTAAATTACCCAATTTGAAATTTTAAAATCCTTTAAAAGAACTTT 28627 
TCAATTAATATTATCACACT - ATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTT 2088 

I M M 1 1 1 M I lllllll I MMII 1 1 1 1 1 1 J I J 1 1 1 1 1 1 f 1 1 1 1 1 1 J J MM 

TTAATTAATATTTTCACACTG CTGATCAGACTGTAATTAGATG CAAATGAGAGAGTAGTT 28567 
TAGTTGTTGCATTTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAA 214 8 

MINI II MINI lllllll IMIIIIIMMIIIIIIIMIIIIIIMI MM 

TAGTTGCTGTATTTTTTGGACACTAGAAACATTTAAATGATCAGGAGGGAGTAACTGAAA 28507 
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29098 


Qy 


1610 
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29038 


Qy 


1670 


Db 


28978 


Qy 


1730 


Db 


28918 


Qy 


1790 


Db 


28858 


Qy 


1850 


Db 


28798 


Qy 


1910 


Db 


28746 


Qy 


1970 


Db 


28686 


Qy 


2030 


Db 


28626 


Qy 


2089 


Db 


28566 



Qy 214 9 GAGCAAGGCTGTTTTTGAAAATCATTACA CTTTCACTAGAAGCCCAAACCTCAGCAT 2205 

1 1 IIIIIIMIIMIMIIIIIIIIIII MMMMMMMMMMIMIMM 

Db 28506 GAACAAGGCTGTTTTTGAAAATCATTACACTCCTTTCACTAGAAGCCCAAACCTCAGCAT 28447 

Qy 22 06 TCTGCAATATGTAACCAACATGTCACAAACAAGCAGCATGTAACAG 2265 

lllllllllllllllllllllll lllllllllllllllllllll IIIIIIMIIIII 
Db 28446 TCTGCAATATGTAACCAACATGTTACAAACAAGCA^ 28387 

Qy 2266 CAGCTGAATTTAAAATATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGT 2325 

1 1 1 1 III IIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIII I II I! 

Db 2 8386 CAGCCAAATCTAAAATATAATACTTTTAAAAAGAAAATTATTACACCCTTTACATTCAGA 28327 

Qy 2326 TAAGATCAAAC CTCACAAAGAGAAATAGAATGTTTGAAAGG CTATCCCAAAAGACTTTTT 2385 

1 1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 I M 1 1 !l 1 1 1 1 1 1 llllllllllllll 

Db 28326 TAAGATCAAAC CTCACAAAGAGAAATAGAATGTTTGAAAGG CTG CCCCAAAAGACTT CTT 28267 

Qy 2386 TGAATCTGTCATTC^C^TACCCTGTGAAGAC^ATACTATCTACAATTTTTTCAGGATTAT 2445 

llllllll llllllll I IMIIIIIII llllllllllllll IIIIIIIIIIMM 

Db 2 8266 TGAATCTGCCATTC^CACAGCCTGTGAAGAAAATACTATCTAOW^TTTTTCAGGATTAT 28207 

Qy 244 6 TAAAATCTTCTTTTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAAT 2 505 

llllllllllll Illllllll III MINN III III! Ill MM III III MUM 

Db 28206 TAAAATCTTCTTCTTTCACTATTGTAGCTTAAACTCTGTTTGGTTTTGTCATCCGTAAAT 28147 

Qy 25 06 ACTTACCTACATACACTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCT 2 565 

Mill MIIIIIIIIMIIIIMM IMMM IIMM MIMIIII MUM I 

Db 2814 6 ACTTAGCTACATACACTGCATGTAGACGATTAAACGAGGGCGGGCCCTGTGTTCATAGTT 28087 

Qy 2566 TTACGATGGAGAGATGCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGT 2625 

I II I I I I II I I I II I I I I I I M I I I II I I II I I I I I I I I I I I I I I I I I I I M I III 
Db 28 086 TTACAATGGAGAGATGCCAGTGACCTCATAATAGAGACTGTGAACTGCCTGGTGCGATGT 28 027 

Qy 2626 CCACATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTC 2 685 

MIMIIMM MMMMMMMMIMMMMIMIMM llllllllllllll 

Db 28 026 CCACATGACAAGGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTAGTTAAAATGGTTTC 27967 

Qy 2686 TAGCATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACA 2745 

MMMMMMMIMI Illllllll III MM II III MM III Mil III III 

Db 27 966 TAG CATATGTATAATG CTGTAGTTAAAACACTGTTTTGCAAAATCATACAGATTAGTACA 27907 

Qy 2 746 TTTAACAGCTACCTGTAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAAT 2805 

Mill lllllllllllllllllllllll MMMMMMMMMMMMIMM 

Db 27 906 TTTAATGGCTACCTGTAAAGCTTATTACTAGTTTTTGTATTATTTTTGTAAATAGCCAAT 27847 

Qy 2 8 06 AGAAAAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGA 2865 

llllllll llllllll MIMIIMIMIIM lllllllllllllllllllllll 

Db 27846 AGAAAAGTGTGCTTGACGTGGTGCTTTTCTTTCACTTAGAGGCAAAACTGCTTTTTGAGA 27787 

Qy 2866 CCGTAAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGT 2 925 

I I II I Ml M MINIMI IM I M! M MINIMUM 

Db 2 7786 CTGTAAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTAAATCTTCTAAGCAAAGT 27727 

Qy 2 926 GCCTTAGGATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAG 2 985 

IMMM M : II II II II I II M IN M II I Ml INN 

Db 27726 GCCTTAGAATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAG 27667 

Qy 2 986 AGAGGAAATGAGGTGGGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAA 304 5 



Db 


27666 


Qy 


3046 


Db 


27606 


Qy 


3106 


Db 


27546 


Qy 


3166 


Db 


27486 


Qy 


3226 


Db 


27426 


Qy 


3286 


Db 


27366 


Qy 


3346 


Db 


27306 


Qy 


3406 


Db 


27246 


Qy 


3466 


Db 


27186 


Qy 


3525 


Db 


27126 


Qy 


3585 


Db 


27066 


Qy 


3645 


Db 


27006 


Qy 


3705 


Db 


26947 


Qy 


3765 


Db 


26887 


Qy 


3825 



Illllllllllllllllll llllllll : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M I ll 1 1 1 

AGAGGAAATGAGGTGGGGTGAGAGGAAACTCATGGGGACAGATTCCCATTCTTAGCCTAA 27607 
CGTTCGTCATTGCCTCGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACAC 3105 

IMIIMIIIIIMIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIMIIIIII 

CGTTCGTCATTGCCTCGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACAC 2754 7 
AGTGCAATGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTC 3165 

lllllllllllllllllllllllll IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIM 

AGTGCAATGTTCTCAGAGTGACTTTAGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTC 27487 
TTAAAATATGCCCAAATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGA 3225 

IIIIIMIIIIIIIIIIIIIIIIIII lllllllllll II I II 11:11 II 

TTAAAATATGCCCAAATTTTTACTTTTTTTTTCTTTTAGTAAACTGGGCCACATGTTGGA 27427 
AATAAGCTAGTAATGTTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAAC 3285 

I IIIIM I II MINI II MM llllllllllll llllllllllllll 

AATAAG CTAGTAATGTTGTTTT CTGTCAATATCGAATGTGATGGTGCAGTAAACCAAAAC 27367 
CCAACAATGTGG CCAGAAAGAAAGAG CAATAATAATTAATTCACACACCATATGGATTCT 334 5 

IIIIIMIIIIIIIIIIIIIMIMIIIMIII lllllllllll llllllllllll 

CCAACAATGTGG CCAGAAAGAAAGAG CAATAATGATTAATTCACATGCCATGTGGATTCT 27307 
ATTTATAAATCACCCAC^^CTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCC 34 05 

I 1 1 ; ! i 1 1 I 111,1 ll! I : illllllllllllllllll lllllllllllll 

ATTTATAAATCACCCACAAACTTGTTTTTTAATTTCATCCCAATCATTTTTTCAGAGGCC 27247 
TGTTATCATAGAAGTCATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATT 34 65 

llllllllllllll lllllllllll llllllllllllll MINIMUM 

TGTTATCATAGAAGACATTTTAGACTTGCAATTTTAAATTAACTTTGAATCACTAATATT 27187 

TTCACAGTTTATTAATATA - TTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCAT 3524 

: I III III llllllllllllllllllllllllllllllllllll 

TTCACAGTTTATTAATATATTTTTATTTCTATTTAAATTTTAGATTATTTTTATTACCAT 27127 

GTACTGAATTTTTACATCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAA 3584 

llllllllllllll M II I M 1 1 I Ml 

GTACTGAATTTTTATATCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTGTAA 27067 
TTATCTTGCCAAATTTTGAAACTACACACAAAAAGCAT^ 3644 

lllllll M MM I IMMMI IIMM! I III MM 

TTATCTTACCAAATTTTGAAACTGCACACAAAAAG 27007 
TTGC^TTCAGTGGCTTTTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGT 37 04 

Illllllllllllllllll lllllllllllllllllllll II II II Ml 

TTGCATTCAGTGGCTTTTT - AAAAAAATGTTTGATTCAAAATTTTAACATACTGATAAGT 26948 
AAGAAACAATTATAATTTCTTTACATACT CAAAAC CAAGATAGAAAAAGGTG CTATCGTT 3764 

IIIIIIMM IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII II 

AAGAAACAATAATAATTTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATTATT 26888 
CAACTT CAAAACATGTTT CCTAGTATTAAGGACTTTAATATAG CAACAGACAAAATTATT 3824 

II II M Ml I II I II MMM III IMM II MM 

TAACTTCAAAACATGTTTCCTAGTATTAAGAACTTTAATATAGCAACAGAC^^ 26828 
GTTAACATGGATGTTACAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTT 3884 

MM lllllllllllll Ml ! Ml I MM MIMM I 



Db 



26827 GTTAACATGAATGTTACAGCTCAGAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTT 26768 



Qy 


3885 


Db 


26767 


Qy 


3945 


Db 


26707 


Qy 


4005 


Db 


26647 


Qy 


4065 


Db 


26587 


Qy 


4120 


UJJ 


o £ c; o *7 


Qy 


4180 


Db 


26467 


Qy 


4240 


Db 


26407 



ATTATCCACTGCTAATGTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATA 3944 

IIIIIIIIIIIIMIIIIIIII IIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIII 

ATTATCCACTGCTAATGTGGATATATGTTCAAACACCTTTTAGTATTGATAGCTTACATA 26708 
TGGCC^AAGGAATACAGTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAG 4 004 

I I : I :i: h IIIMIIIIIII llllllllllllllllllll 

TGGCCAAAGGAATACAGTTTATAGTGAAACATGGGTATACTGTAGCTAACTTTATAAAAC 2664 8 
TGTAATATAACAATGTAAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGG 4 064 

I i M II II !! Ml Mill lllllllllllllll I'll 

TGTAATATAACAATGTAAAAAATTATATACCTGGGGGGATTTTTTGGTTGCTTAAAGTGG 2 6588 
CTATAGTTACTGA - TTTTTTATTATGTAAGCAAAACCAATAAA AATTTAAGTTTTT 4119 

II II Mill II II l':IM I ill IIMI I II MIMI 

CTATAGTCACTGATTTTTTTATTATGTAAGCAAAACCAATAAACTTTAGGTTGTGTTTTT 26528 
TTAACAACTACCTTATTTTTCACTGTACAGACACTAATTCATTAAATACTAATTGATTGT 4179 

Mill 1 1 1 1 II I II II lllllll : 1 1 i . I J I 1 1 1 . 1 , : Ml 

TTAACAACTAGCTTATTTTTCATTGTACAGGCACTAATTCATTAAATACTAATTGACTGT 26468 
TTAAAAGAAATATAAATGTGACAAGTGGACATTATTTATGTTAAATATACAATTATCAAG 423 9 

IIMI III II IMIl: I! II I I II II III I I MIMI 

TTAAAGGAAATATAAATGTGACAAGTGGACACTATTTATGTTAAATATACAATCATCAAG 264 08 
CAAGTATGAAGTTATTCAATTAAAATGCCACATTTCTGGTCTCTGG 42 85 

Mill Ihlli II i. II I II III II II 

GAAGTATGAAGTTATTCAATTAAAATGCCACATTTCTGGTCTCTGG 26362 



RESULT 12 

AC129069/C 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



Craniata; Vertebrata ; Euteleostomi ; 
Catarrhini ; Cercopithecidae; 



TITLE 
JOURNAL 



AC129069 185870 bp DNA linear HTG 19-SEP-2002 

Papio anubis clone RP41-24 0D13, WORKING DRAFT SEQUENCE. 

AC129069 

AC129069.2 GI : 23196382 
HTG; HTGS_PHASE2; HTGS_DRAFT . 
Papio anubis (olive baboon) 
Papio anubis 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates,- 
Cercopithecinae ; Papio . 
1 (bases 1 to 185870) 

Akhter,N., Antonellis,A. , Ayele,K., Beckstrom-Sternberg, S . M. , 
Benjamin, B., Blakesley, R . W . , Bouf fard,G.G. , Brinkley,C, Brooks, S., 
Cariaga,K., Coleman, B., Dietrich, N. L. , Granite, S., Guan,X., 
Gupta, J., Haghighi,P w Han, J., Hansen, N,, Ho,S.-L., Idol,J.R., 
Karlins,E., Laric,P., Lee-Lin, S . -Q. , Legaspi,R., Maduro,Q.L., 
Maduro,V.B., Margulies, E.H. , Masiello,C, Maskeri,B., 
Mastrian,S.D. , McCloskey, J . C . , McDowell, J., Paguirigan, C. , 
Pearson, R. , Portnoy,M.E. , Prasad, A. , Reddix-Dugue,N. , 
Schueler,M.G. , Sison,C, Stantripop, S . , Thomas, J. W., Thomas, P. J., 
Touchman, J.W. , Vogt,J.L., Walker,M., Wetherby, K. D. , Wiggins, L. , 
Young, A., Zhang,L.-H. and Green, E.D. 
NISC Comparative Sequencing Initiative 
Unpublished 



REFERENCE 2 (bases 1 to 185870) 
AUTHORS Green, E.D. 
TITLE Direct Submission 

JOURNAL Submitted (25-JUL-2002) NIH Intramural Sequencing Center, 8717 
Grovemont Circle, Gaithersburg, MD 20877, USA 
REFERENCE 3 (bases 1 to 185870) 
AUTHORS Green, E.D. 
TITLE Direct Submission 

JOURNAL Submitted ( 19-SEP-2002 ) NIH Intramural Sequencing Center, 8717 
Grovemont Circle, Gaithersburg, MD 20877, USA 
COMMENT On Sep 19, 2002 this sequence version replaced gi : 21955004. 

Genome Center 

Center: NIH Intramural Sequencing Center 
Center code: NISC 

Web site: http://www.nisc.nih.gov 
Contact : nisc_zoo@nhgri .nih.gov 

Project Information 

Center project name: deg 
Center clone name: 24 0D13 



The sequence data in this record represents an 'enhanced' 
version of a Phase 2 submission. Specifically, the indicated 
order and orientation of each sequence contig has been 
established using one or more of the following: read-pair 
data from individual subclones, overlaps with neighboring 
clones, alignment with available reference sequence (e.g., 
human) , and/or confirmation by PCR testing. In addition, 
the sequence assembly is based on at least 8X average 
coverage in Q20 bases and has been reviewed to rule out 
gross misassemblies , the low-quality ends of sequence 
contigs have been trimmed away, and each base is associated 
with a Phrap-derived quality score. 

Summary Statistics 

Sequencing vector: plasmid; n/a; 100% of reads 
Chemistry: Dye-terminator Big Dye; 100% of reads 
Assembly program: Phrap; version 0.990319 



FEATURES 

source 



Consensus quality 
Consensus quality 
Consensus quality 



184076 bases at least Q40 
185363 bases at least Q30 
185733 bases at least Q20 
Insert size: 152000; agarose-fp 
Insert size: 185870; sum-of -contigs 
Quality coverage: 7.20x in Q20 bases; agarose-fp 
Quality coverage: 5.89x in Q20 bases; sum-of -contigs 



NOTE: This is a 'working draft' sequence. It currently 

consists of 1 contigs. Gaps between the contigs 

are represented as runs of N. The order of the pieces 

is believed to be correct as given, however the sizes 

of the gaps between them are based on estimates that have 

provided by the submittor. 

This sequence will be replaced 

by the finished sequence as soon as it is available and 
the accession number will be preserved. 

1 185870: contig of 185870 bp in length. 

Location/Qualifiers 

1. .185870 

/organism="Papio anubis" 



/mol_type= "genomic DNA n 
/db_xref="taxon: 9555" 
/clone="RP41-24 0D13" 
/clone_lib="RP41" 
misc_f eature 1. .185870 

/note="ass emb 1 y_ f r agmen t 
clone_end:T7 
vector_side : left 

missing approximately 55 bases, including Sp6 clone end, 

on 3* end of insert" 
misc_feature 121812. .185870 

/note=" clone overlaps with GenBank Accession Number 

AC130785 clone RP41-325P5 (center project name deh) " 
BASE COUNT 55300 a 34781 c 35402 g 60387 t 
ORIGIN 



Query Match 59.3%; Score 2550; DB 2; Length 185870; 

Best Local Similarity 94.8%; Pred. No. 0; 

Matches 2717; Conservative 0; Mismatches 13 0; Indels 19; Gaps 7; 
Qy 1430 AGTCATGCTTATGCTGCTGGTGCC^GTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGC 148 9 

IIIIIIIMIIIMMMIIMIIMIIIIIIIIIIMMIIIIIMIIMIIIIIIIII 

Db 151685 AGT CATG CTTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTC CTTGGAGGAAAAG C 

151626 



Qy 14 90 AGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATA 154 9 

I Ml I Ml I MMll II II IIMI I III IM IMM I 

Db 15162 5 AGTCGTGCTTAAAGTTCAAAG CTAATGATCACGGATATGACAACTTC CGTTCCAGTAATA 

151566 



Qy 1550 AATACA.GCTCA.TCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACC 1609 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 

Db 151565 AATACAGCTCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACC 

151506 



Qy 1610 GAAGTCATTAAAACAAAATGAAACATTTGC 1669 

lllllllllllllllllllllllllllll IIIIMIIIIIIIIIIIIIIIIMIIMII 

Db 151505 GAAGTCATTAAAACAAAATGAAACATTTGTCAAAACAAAA 

151446 



Qy 1670 CAG CACACTATTAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATT 172 9 

I IMIIMI Ml M; IM MMIMM IIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 151445 CAGCACACTATTAAAATATTAAGTGTAATTATTTTAAC^CTC^ 

151386 



Qy 173 0 TTATGAG CTGTTTACGG CATGGAAAGAAAATCAGTGGGAATTAAGAAAG C CTCGTCGTGA 178 9 

MM II III II I II MMIMM IE IMM IM I II | I 

Db 151385 TTATGAG CTGTTTACAG CATGGAAAGAAAATCAGTGGGAATTAAGAAAG C CTCATTGTGA 

151326 



Qy 1790 AAGC^CTTAATTTTTTAC^GTTAGCACTTCAAC^TAGCTCTTAAC^CTTCCAGGATATT 184 9 

I Ml II II I II I MMll Ml IMIIIIIIIIIIII 

Db 151325 AAGCACTTACTTTTTTATGGTTAGCACTTCAACATAGCTCTTAATAACTTCCAGGATATT 

151266 



Qy 1850 (^CA(^CACTTAGGCTTAAAAATGAGCTO\CTCAGAATTTCTATTCTTTCTAAAAAGAG 1909 

IMIIMI I II M Ml I II I IM II II Mill 



Db 1512 65 CACACAACCCTTAGGCTTAAAAATGAGCTCACTCGGAATTTCTATT TAAGAG 

151214 

Qy 1910 ATTTATTTTTAAATCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGA 1969 

IIIIIIMMIIIIIIIII II lllllll IMMIIII lllllllllllllllllll 
Db 151213 ATTTATTTTTAAATCAATGTGAATCTGATACAAAGGAAGAGTAAGTCACTGTAAAACAGA 

151154 

Qy 1970 ACTTTTAAATGAAGCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTT 2 02 9 

llllllllllllllllllllllll 1 1 1 1 1 1 lllllllllllllllllllll llllll 

Db 151153 ACTTTTAAATGAAGCTTAAATTACCCAATTTGAAATTTTAAAATCCTTTAAAAGAACTTT 

151094 

Qy 2 030 TCAATTAATATTATCACACT - ATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTT 2088 

I I II I llllll! I llllll llllllllllllllllllllllll 1 1 II 

Db 151093 TTAATTAATATTTTCACACTGCTGATCAGACTGTAATTAGATGCAAATGAGAGAGTAGTT 

151034 

Qy 208 9 TAGTTGTTG CATTTTT CGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAA 2148 

llllll II llllll lllllll Mill llllllll IMIIIII, llllll 1 1 M 

Db 151033 TAGTTGCTGTATTTTTTGGACACTAGAAACATTTAAATGATCAGGAGGGAGTAACTGAAA 

150974 

Qy 214 9 GAGCAAGGCTGTTTTTGAAAATCATTACA CTTTCACTAGAAGCCCAAACCTCAGCAT 2205 

M IIIIIIIIIIIIIIIIIIIIIMIII MIMIIIIIIIIIIIIIIIIIIIIIII 
Db 150973 GAACAAGGCTGTTTTTGAAAATCATTACACTCCTTTCACTAGAAGCCCAAACCTCAGCAT 

150914 

Qy 22 06 TCTGCAATATGTAACCAACATGTCACAAACAAGCAGCATGTAAC^ 2265 

I I I I 1 I I I I I I I 1 t I I I ! I M I I I .1 ; II lllllllllllll 

Db 150913 TCTGCAATATGTAACCAACATGTTACAAACAAGCAGCATGTAACAAACTGGCA 

150854 

Qy 2266 CAGCTGAATTTAAAATATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGT 2325 

1 1 1 1 III M III II M llllll III III I. M IIIIIMIIIIII 

Db 150853 CAGCCAAATCTAAAATATAATACTTTTAAAAAGAAAATTATTACACCCTTTACATTCAGA 

150794 

Qy 232 6 TAAGATCAAACCTCACAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTT 2385 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 i II 1 1 1 1 II 

Db 150793 TAAGATCAAACCTCACAAAGAGAAATAGAATGTTTGAAAGGCTGCCCCAAAAGACTTCTT 

150734 

Qy 2386 TGAATCTGTCATTCACATACCCTGTGAAGAC^TACTATCTACAATTTTTTCAGGATTAT 244 5 

llllllll llllllll I IMIIMIII IIIIIIIIIIMII :l I , I: 

Db 150733 TGAATCTGCCATTCACACAGCCTGTGAAGAAAATACTATCTACAAATTTTTCAGGATTAT 

150674 

Qy 2446 TAAAATCTTCTTTTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAAT 2505 

Mllllllllll I'M M II U II U' II ll'll llllll 

Db 150673 TAAAATCTTCTTCTTTCACTATTGTAGCTTAAACTCTGTTTGGTTTTGTCATCCGTAAAT 

150614 

Qy 2506 ACTTACCTACATACACTG CATGTAGATGATTAAATGAGGG CAGG CCCTGTGCTCATAG CT 2565 

Mill INI lllllll IMMIIII lllllll llllll IMMIIII llllll I, 

Db 150613 ACTTAGCTACATACACTGCATGTAGACGATTAAACGAGGGCGGGCCCTGTGTTCATAGTT 

150554 



Qy 2566 TTACGATGGAGAGATGCCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGCAGTGT 2625 

Ml! I' INI IMI I 1 I II II I MI'I II II 'I Ml 

Db 150553 TTACAATGGAGAGATG C CAGTGACCTCATAATAGAGACTGTGAACTG CCTGGTGCGATGT 

150494 

Qy 2626 CCACATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTC 2685 

IIIIMMIII IIIIMIIIIIIIIIIIIMIIIIIIIMIIII IIIIMIIIIIMI 

Db 15 04 93 CCACATGACAAGGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTAGTTAAAATGGTTTC 

150434 

Qy 2686 TAGCATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACA 2745 

I II M' M II II III Ml 1 1 1 1 llllllllllllllllllllll 

Db 150433 TAGCATATGTATAATG CTGTAGTTAAAACACTGTTTTG CAAAATCATACAGATTAGTACA 

150374 

Qy 2746 TTTAACAGCTACCTGTAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAAT 2805 

Mill lllllllllllllllllllllll h I! I Ml III I; Hill II 

Db 150373 TTTAATGG CTACCTGTAAAG CTTATTACTAGTTTTTGTATTATTTTTGTAAATAG CCAAT 

150314 

Qy 28 06 AGAAAAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGA 2865 

I I ' ! I I I 1 1 i IIIIIIIIIIIIIIIIIIMIMI 

Db 150313 AGAAAAGTGTGCTTGACGTGGTGCTTTTCTTTCACTTAGAGGCAAAACTGCTTTTTGAGA 

150254 

Qy 2866 CCGTAAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGT 2 925 

I lllllllllllllllllllllllllllllllllllllllll llllllllllllllll 

Db 150253 CTGTAAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTAAATCTTCTAAGCAAAGT 

150194 

Qy 2 926 GCCTTAGGATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAG 2 985 

IMIIM IIIMIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIMIIIIMIII 

Db 15 0193 GCCTTAGAATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAG 

150134 

Qy 2 986 AGAGGAAATGAGGTGGGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGCCTAA 3 04 5 

IIIIIIIIIIIIIIIIMI llllllll MIMMIMIIIIIIIMIIIIIIIIIII 

Db 150133 AGAGGAAATGAGGTGGGGTGAGAGGAAACTCATGGGGACAGATTCCCATTCTTAGCCTAA 

150074 

Qy 3046 CGTTCGTCATTGCCTCGTCACATC^TGCAAAAGGTCCTGATTTTGTTCCAGCAAAACAC 3105 

MMI I I IIIMM I M II II M M: HIM M I II MM 

Db 15 0073 CGTTCGTCATTGCCTCGTC^C^TC^TGCAAAAGGTCCTGATTTTGTTCC^GCAAAACAC 

150014 

Qy 3106 AGTGCAATGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTC 3165 

II I M MMIIIMM I ; I : I I , I I i I ] I ! 1 1 1 

Db 150013 AGTG CAATGTTCT CAGAGTGACTTTAGAAATAAATTGGG CC CAAGAGCTTTAACTCGGTC 

149954 

Qy 3166 TTAAAATATGCCCAAATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGA 3225 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 14 9953 TTAAAATATGCCCAAATTTTTACTTTTTTTTTCTTTTAGTAAACTGGGCCACATGTTGGA 

149894 



Qy 



3226 AATAAGCTAGTAATGTTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAAC 3285 



IIIIIMIIIIIIIIIMIIIIMIIIIIIII IIIIIMIIMI IIIIIMMIIIII 

Db 14 98 93 AATAAGCTAGTAATGTTGTTTTCTGTCAATATCGAATGTGATGGTGCAGTAAACCAAAAC 

149834 

Qy 3286 CCAACAATGTGGCCAGAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCT 334 5 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII IIIIMIIIII 1 1 1 1 1 1 1 1 1 1 1 1 

Db 14 9833 CCAACAATGTGG CCAGAAAGAAAGAG CAATAATGATTAATTCACATGCCATGTGGATTCT 

149774 

Qy 3346 ATTTATAAATCACCCACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCC 34 05 

MINI 1 II Ml II II I I Mill II II IM II II 

Db 14 9773 ATTTATAAATCACCCACAAACTTGTTTTTTAATTTCATCCCAATCATTTTTTCAGAGGCC 

149714 

Qy 34 06 TGTTATCATAGAAGTCATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATT 3465 

MUM II Ml IIIIMIIIII I Mill I IIIIMIIMIIIIIII 

Db 14 9713 TGTTATCATAGAAGACATTTTAGACTTGCAATTTTAAATTAACTTTGAATCACTAATATT 

149654 

Qy 34 66 TTCACAGTTTATTAATATA - TTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCAT 3524 

I MUM IM II Ml IMIIIMI II II Ml M Ml III 

Db 14 9653 TTCACAGTTTATTAATATATTTTTATTTCTATTTAAATTTTAGATTATTTTTATTACCAT 

149594 

Qy 352 5 GTACTGAATTTTTACATCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAA 3584 

I II IM I MM II II Ml II II MM I Ml Ml III 

Db 14 9593 GTACTGAATTTTTATATCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTGTAA 

149534 

Qy 3585 TTATCTTGCCAAATTTTGAAACTACACACAAAAAGCATACTTGCATTATTTATAATAAAA 3644 

II II I 1 1 1 1 1 1 II II IMMI MM I MM II 

Db 149533 TTATCTTACCAAATTTTGAAACTG CACACAAAAAGCATACTTG CATTATTTATAATAAAA 

149474 

Qy 3 645 TTGCATTCAGTGGCTTTTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGT 3704 

IMMI II III Ml MUM II llllliill II Ml III II M 

Db 14 9473 TTGCATTCAGTGGCTTTTT - AAAAAAATGTTTGATTCAAAATTTTAACATACTGATAAGT 

149415 

Qy 3705 AAGAAACAATTATAATTTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTT 3764 

I IM I M I II MIMM I II Ml I M Ml || 

Db 14 9414 AAGAAACAATAATAATTTCTTTACATACTCAAAACCAAGATAGAAAAA.GGTGCTATTATT 

149355 

Qy 3765 CAACTTCAAAACATGTTTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATT 3824 

II Ml M Ml Ml I MM I li MUM II II; II! Ill I 

Db 14 9354 TAACTTCAAAACATGTTTCCTAGTATTAAGAACTTTAATATAGCAACAGACAAAATTATT 

149295 

Qy 3 825 GTTAACATGGATGTTACAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTT 3884 

llllliill IMMI IM I II Ml MMI IMM MM M I 

Db 14 92 94 GTTAACATGAATGTTACAGCTCAGAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTT 

149235 

Qy 3885 ATTATCCACTGCTAATGTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATA 3944 

IMMIIIIIIIMIIIIIIil IIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIII 



Db 14 9234 ATTATCCACTGCTAATGTGGATATATGTTCAAACACCTTTTAGTATTGATAGCTTACATA 

149175 



Qy 



Db 

149115 



394 5 TGGC CAAAGGAATACAGTTTATAGCAAAACATGGGTATG CTGTAG CTAACTTTATAAAAG 4 004 

I II I II II MMI 111,1 I 'I I I 

14 9174 TGGCCAAAGGAATACAGTTTATAGTGAAACATGGGTATACTGTAGCTAACTTTATAAAAC 



Qy 

Db 

149055 



4 005 TGTAATATAACAATGTAAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGG 4 064 

II II I ll;l! Illlll I Mill III I Mill I I. I 

14 9114 TGTAATATAACAATGTAAAAAATTATATACCTGGGGGGATTTTTTGGTTGCTTAAAGTGG 



Qy 

Db 

148995 



4 065 CTATAGTTACTGA - TTTTTTATTATGTAAGCAAAACCAATAAA AATTTAAGTTTTT 4119 

lllllll Mill 1 1 1 M M 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I || MUM 

14 9054 CTATAGTCACTGATTTTTTTATTATGTAAGCAAAACCAATAAACTTTAGGTTGTGTTTTT 



Qy 



Db 



412 0 TTAACAACTACCTTATTTTTCACTGTACAGACACTAATTCATTAAATACTAATTGATTGT 4179 

llllllllll lllllllllll lllllll I I i ; I I I i I 'II I II 1 1 ! I Ml 

148994 TTAACAACTAGCTTATTTTTCATTGTACAGGCACTAATTCATTAAATACTAATTGACTGT 



148935 



Qy 



Db 

148875 



4180 TTAAAAGAAATATAAATGTGACAAGTGGACATTATTTATGTTAAATATACAATTATCAAG 4239 

Mill IM M II IM II II I IIMI I! II I 'II Illlll 

14 8934 TTAAAGGAAATATAAATGTGACAAGTGGACACTATTTATGTTAAATATACAATCATCAAG 



Qy 424 0 CAAGTATGAAGTTATTCAATTAAAATGCCACATTTCTGGTCTCTGG 4285 

II ilMM; IIMM II II I II MUM IM I 

Db 14 8874 GAAGTATGAAGTTATTCAATTAAAATG C CACATTTCTGGTCTCTGG 148829 



RESULT 13 

AR165435 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
FEATURES 

source 

BASE COUNT 
ORIGIN 



AR165435 1873 bp DNA linear PAT 17-OCT-2001 

Sequence 13 from patent US 6280931. 

AR165435 

AR165435.1 GI : 16240327 

Unknown . 

Unknown . 

Unclassified. 

1 (bases 1 to 1873) 

Sakamoto, A. and Hanaoka,F. 

Method for specifically amplifying a dystroglycan, 

.alpha . -sarcoglycan, or endothelin Breceptor cDNA of an extremely 

small 

Patent: US 6280931-A 13 28 -AUG-2001 ; 
Location/Qualif iers 
1. .1873 

/organism^" unknown" 
490 a 434 c 438 g 511 t 



Query Match 39.3%; Score 1691.8; DB 6; Length 1873; 

Best Local Similarity 99.6%; Pred. No. 6.1e-293; 



Matches 1696; Conservative 



0; Mismatches 7; Indels 0; Gaps 



0; 



Qy 


178 


Db 


171 


Qy 


238 


Db 


231 


Qy 


298 


Db 


291 


Qy 


358 


Db 


351 


Qy 


418 


Db 


411 


Qy 


478 


Db 


471 


Qy 


538 


Db 


531 


Qy 


598 


Db 


591 


Qy 


658 


Db 


651 


Qy 


718 


Db 


711 


Qy 


778 


Db 


771 


Qy 


838 


Db 


All 


Qy 


898 


Db 


891 


Qy 


958 


Db 


951 



I IIMIIIMMIIIIIIIIIIIIIIMIIMIIMIIMIMIIIMIIMI 



ATGCAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 2 97 

IIMIIIIIIMIIIIIIIIMMIIII IIIIIIIIIIIIIIIIIIIIIIIIIIMII 

ATGCAGCCGCCTCCAAGTCTGTGCGGACCGGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 290 
CTGT CG CGGATCTGGGGAGAGGAGAGAGG CTTCC CG CCTGACAGGGCCACTCCG CTTTTG 357 

IIIIIIIMIIIIIIIIIIIIIIMIMIIIIIIIIIMIIIIIIIMIIIIIIIIIIII 

CTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTG 350 
(^VAACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCC 417 

IIIIIIIIIIIMIIMIIIIIIIIIMIMIIIIIIIIIMIIIIIMIIIMIIIIII 

CAAACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCC 410 
AGT CTGGCGCGGTCGTTGGCAC CTG CGGAGGTG CCTAAAGGAGACAGGACGG CAGGATCT 477 

IMIIIMIIIIIIIIIIIIIIMIIIIIMIIIIIIIIMIIIIIIIIIIIMIIIIII 

AGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCT 470 
CCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 537 

IIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

CCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 530 
AAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCC 5 97 

IMMI I Ml I MMM II I M IM II III IMM II Ml 

AAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCC 5 90 
ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 657 

I I MMM II I II I MM MMM I II M M IM M II 

ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 650 



MMM I Ml M MMM II I M M ; MMM I Ml M I 



AAGCTG CTGG CAGAGGACTGG CCATTTGGAG CTGAGATGTGTAAG CTGGTG C CTTT CATA 777 

IIIIIIIIIIIIIIMIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AAGCTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATA 770 

CAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATAT 837 

MMM M Ml MMIMM: IM I II Ml MMM Mill II Ml 

CAGAAAGCCT CCGTGGGAATCACTGTG CTGAGT CTATGTG CTCTGAGTATTGACAGATAT 83 0 

CGAGCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTA 897 

' IMMM Mill II I II M IM I Mill Ml IM M I 

CGAGCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTA 8 90 

GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 957 

INI II IM I II I II I MM I II II II M M I 

GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 950 

GATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTT 1017 

I MM I MMM M II I Mi, M IM I Ml MMM IM II II 

GATATAATTACGATGGACTACAAAGGAAGTTAT CTG CGAATCTG CTTG CTTCATCCCGTT 1010 



Qy 


1018 


Db 


1011 


Qy 


1078 


Db 


1071 


Qy 


1138 


Db 


1131 


Qy 


1198 


Db 


1191 


Qy 


1258 


Db 


1251 


Qy 


1318 


Db 


1311 


Qy 


1378 


Db 


1371 


Qy 


1438 


Db 


1431 


Qy 


1498 


Db 


1491 


Qy 


1558 


Db 


1551 


Qy 


1618 


Db 


1611 


Qy 


1678 


Db 


1671 


Qy 


1738 


Db 


1731 


Qy 


1798 


Db 


1791 



IIIMIIIIMIMIIIIIIIIIIIIIIIIMIIIIMIIIMIIIIIII MINIMI 



TATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATG 1137 

M 1 1 1 1 1 II M 1 1 M II M I II 1 1 1 II M 1 1 II 1 1 1 1 II M II M M I II 1 1 1 II 1 1 II I 

TATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATG 113 0 

TTGAGAAAGAAAAGTGG CATG CAGATTG CTTTAAATGATCACCTAAAG CAGAGACGGGAA 1197 

I I I I II I II II I I II II I I I I I I II II II I II I I I I I I I I II I I I II I I II I I M I I I I I 
TTGAGAAAGAAAAGTGG CATG CAGATTG CTTTAAATGATCAC CTAAAG CAGAGACGGGAA 1190 

GTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCAC 1257 

M IMM MMM III I MMM II II! I MMMI IMM Ml 

GTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCAC 1250 
CTCAGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTT 1317 

MMIIIIIIIMIIIIIMIIIIIIIIMIIIIIIIIMIIMIIIMIIIIMIMII 

CTCAGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTT 1310 

TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 1377 

M M I I I I I I I I I I I II I I I I I I I I I I I I I I I M II I I I II I I II I II I I II I I I I I II I 
TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 1370 

ATTAACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGC 1437 

IM I MM II III II II IM I MMM I I! M III I! II 

ATTAACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGC 1430 
TTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGC 14 97 

MMMMMMMMMMMMMMMIIMMMMMMMMMMMMIMI 

TTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGC 14 90 
TTAAAGTTOy^GCTAATGATCACGGATATGAC^CTTCCGTTCC^VGTAATAAATACAGC 1557 

IMM II II II III I II I MMM M IM IMM I III 

TTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGC 1550 
TCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCAT 1617 

I II II MMM M M Ml I Ml IMM ill I MIMM II 

TCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCAT 1610 
TAAAACAAAATGAAACATTTGCCAAAACAAAA 1677 

J i II! [ i 1 1 1 : i 1 1 : II I I [ : 1 1 1 1 ! I 

TAAAACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTAT 1670 
TATTAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG C 1737 

I II II MM MMI I Ml M II MM I IM M Ml II I 

TATTAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG C 1730 
TGTTTACGG CATGGAAAGAAAATCAGTGGGAATTAAGAAAG CCTCGTCGTGAAAG CACTT 1797 

I MMM II I II. M IMM IM II M II M Ml MMM 

TGTTTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAG CCTCGT CGTGAAAGCACTT 1790 
AATTTTTTAC&GTTAGC^CTTCAAC^TAGCTCTTAAC 1857 

M MMM II M II I M IMM II IMM M II Ml II 

AATTTTTTACAGTTAGCTVCTTC^C^TAGCTCTTAACAACTTCC^GGATATTCACACAAC 185 0 



Qy 1858 ACTTAGGCTTAAAAATGAGCTCA 1880 

IIIIIIIIIIIMIIIIIIIIII 
Db 1851 ACTTAGGCTTAAAAATGAGCTCA 1873 



RESULT 14 

E15242 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



E15242 1873 bp DNA linear PAT 28-JUL-1999 

Human mRNA for endothelin B receptor, complete cds . 

E15242 

E15242.1 GI :5709925 
JP 1998057064-A/13. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1873) 
Sakamoto , E . and Hanaoka , F . 

SPECIFIC AMPLIFICATION OF MINOR GENE PRODUCT 
Patent: JP 1998057064-A 13 03-MAR-1998; 
RIKAGAKU KENKYUSHO 
OS Homo sapiens (human) 
JP 1998057064-A/13 
03-MAR-1998 

16-AUG-1996 JP 1996216506 
SAKAMOTO EIJI, HANAOKA FUMIO 
C12N15/09,C07H2l/02,C07H2l/04//C12Ql/68; 
strandedness : Double; 
topology: Linear; 



PN 
PD 
PF 
PI 
PC 
CC 

cc 

FH 
FH 
FT 
FT 
FT 

FT 



Key 



source 



231. .1559 



Locat ion/ Qual i f iers 
1. .1873 

/organism= ' Homo sapiens' 
/tissue_type= 'peripheral blood' 



FT 



CDS 



FEATURES 

source 



BASE COUNT 
ORIGIN 



/product = 1 endothelin B receptor' 
Locat ion/Qual i f iers 
1. .1873 

/organism= "Homo sapiens" 
/mol_type= "genomic DNA" 
/db_xref ="taxon: 9606" 
490 a 434 c 438 g 511 t 



Query Match 39.3%; 
Best Local Similarity 99.6%; 
Matches 1696; Conservative 



Score 1691.8; DB 6; 
Pred. No. 6.1e-2 93; 
0; Mismatches 7; 



Length 1873; 



Indels 



0 ; Gaps 



0; 



Qy 
Db 



178 TGAAACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGC 237 

M M I : I I I II' I .11: J , (I'll' [ j ill, : I II 

171 TGTCTCTAGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGC 230 



Qy 

Db 



238 ATGCAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 2 97 

Illlllllllllllllllllllllllll llllllllllllllllllllllllllllll 

231 ATGCAGCCGCCTCCAAGTCTGTGCGGACCGGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 2 90 



Qy 



298 CTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTG 357 



Db 


291 


Qy 


358 


Db 


351 


Qy 


418 


Db 


411 


Qy 


478 


Db 


471 


Qy 


538 


Db 


531 


Qy 


598 


Db 


591 


Qy 


658 


Db 


651 


Qy 


718 


Db 


711 


Qy 


778 


Db 


771 


Qy 


838 


Db 


831 


Qy 


898 


Db 


891 


Qy 


958 


Db 


951 


Qy 


1018 


Db 


1011 


Qy 


1078 


Db 


1071 


Qy 


1138 



IIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIMIIMMIIIIIMIIIIIMIII 

CTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTG 35 0 
CAAACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCC 417 

MIMMIIIIlllllllllllMIIMIMIIIIIIIIIIIIMIMIIIIIIIIIIII 

CAAACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCC 410 
AGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCT 477 

IIIIIIIIIMIIIIIIIIMIIIIIIIIIIMIIIIMIIIIIIIIMIIIIIMIIII 

AGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCT 470 
CCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 537 

IIIMIIIIIIMMIMIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIMIIIIIM 

CCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 53 0 
AAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCC 597 

IIIMMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIII 

AAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCC 590 
ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 657 

IIIIIMIIIMIMIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 650 
GCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTAC 717 

IIMIIIMIIIMIIIIMIIIMIMIIIIIIIMMIIIMIIIIIIIIIIIIIIII 

GCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTAC 710 



MIMMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



CAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATAT 837 

Ml IM I I II M M II I III II II! II II M, I III 

CAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATAT 83 0 



IM IMM I II II I IMM III II III Mill Mi I II 



GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 957 

M M M I II ll M II MMM II Mi Mill Mi I IM 

GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 950 
GATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTT 1017 

I Ml I M, II II Mill I MMM I MIMM I I MMM 

GATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTT 1010 



I IM Ml MM I M II I M IMM II IM I MINIMI 



TATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATG 1137 

IIIIIIIIIIIIIIIIIMIIIMIIIMIMMIIIIIIIIIIIIIIIIIIMIMIII 

TATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATG 1130 
TTGAGAAAGAAAAGTGG CATG CAGATTG CTTTAAATGATCAC CTAAAG CAGAGACGGGAA 1197 

M Ml M I III MMI Ml I II II IM II II I IMM 



Db 



1131 TTGAGAAAGAAAAGTGG CATG CAGATTG CTTTAAATGATCACCTAAAG CAGAGACGGGAA 1190 



Qy 


1198 


Db 


1191 


Qy 


1258 


Db 


1251 


Qy 


1318 


Db 


1311 


Qy 


1378 


Db 


1371 


Qy 


1438 


Db 


1431 


Qy 


1498 


Db 


1491 


Qy 


1558 


Db 


1551 


Qy 


1618 


Db 


1611 


Qy 


1678 


Db 


1671 


Qy 


1738 


Db 


i / ji 


Qy 


1798 


Db 


1791 


Qy 


1858 


Db 


1851 



GTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCAC 1257 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCAC 1250 
CTCAGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTT 1317 

II III Illllll I II II Mi h II ill I.MMiMIII Mill 

CTCAGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTT 1310 
TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 1377 

I M II 'II II II I. II II -II III I'MI h II III II 

TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 1370 
ATTAACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGC 1437 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

ATTAACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGC 1430 
TTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGC 14 97 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

TTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGC 14 90 
TTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGC 1557 

Ml lllll II II Mill II II III MM II Mill MM Illllll II I II II II II IN 

TTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGC 155 0 
TCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCAT 1617 

ImII; I II I III I 11,111 ||;!M :| MMI || Ml | 

TCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCAT 1610 
TAAAACAAAATGAAACATTTG CCAAi^ CACAG CACAC 1677 

IMMIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIMIIIMIIIIIIIIIIII 

TAAAACAAAATGAAACATTTGCCAAAACAAAACAAAAAACTATGTATTTGCA 1670 
TATTAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGC 1737 

I II II II Mill I: II Ml II: IN | || II IMIIh I 

TATTAAAATATTAAGTGTAATTATTTTAACACTCACAG CTACATATGACATTTTATGAG C 173 0 
TGTTTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTT 1797 

M I M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 M M II 1 1 1 1 1 II II 

TGTTTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTT 1790 
AATTTTTTACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATTCACACAAC 1857 

II II IIMIMIMII I II' M II II MM Il'lh'll I 

AATTTTTTACAGTTAGCACTTt^CATAGCTCTTAAa^CTTCCAGGATATTCACACAAC 1850 



Illllllllllllllllllllll 



RESULT 15 
S44866 

LOCUS S44866 1872 bp mRNA linear PRI 07-MAY-1993 

DEFINITION ETB endothelin receptor [human, mRNA, 1872 nt] . 

ACCESSION S44866 

VERSION S44866.1 GI : 233233 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REMARK 



FEATURES 

source 



gene 
CDS 



BASE COUNT 
ORIGIN 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1872) 

Sakamoto, A., Yanagisawa,M. , Sakurai,T., Takuwa,Y., Yanagisawa, H. 
and Masaki,T. 

Cloning and functional expression of human cDNA for the ETB 
endothelin receptor 

Biochem. Biophys . Res. Commun. 178 (2) , 656-663 (1991) 

91315496 

1713452 

GenBank staff at the National Library of Medicine created this 
entry [NCBI gibbsq 44866] from the original journal article. 
This sequence comes from Fig. 1. 

Location/Qualifiers 

1. .1872 

/organism="Homo sapiens" 
/mol_type=" mRNA" 
/db_xref ="taxon: 9606" 
1. .1872 

/gene="ETB endothelin receptor" 
231. .1559 

/gene="ETB endothelin receptor" 
/note="This sequence comes from Fig. 1" 
/codon_start=l 

/product="ETB endothelin receptor" 
/protein_id="AAB19411 .1" 
/db_xref="GI : 233234" 

/ trans 1 at ion=" MQPPPSLCGPALVALVLACGLSRIWGEERGFPPDRATPLLQTAE 
IMTPPTKTLWPKGSNASLARSLAPAEVPKGDRTAGSPPRTISPPPCQGPIEIKETFKY 
INTWSCLVFVLGI IGNSTLLRI I YKNKCMRNGPNILIASLALGDLLHI VIDI PINVY 
KLLAEDWPFGAEMCKLVPF I QKASVGI TVLSLCALS I DR YRAVAS W SRI KG I GVPKWT 
AVE I VLI WWS WLAVPEAI GFDI I TMDYKGS YLRI CLLHPVQKTAFMQFYKTAKDWW 
LFSFYFCLPLAITAFFYTLMTCEMLRKKSGMQIALNDHLKQRREVAKTVFCLVLVFAL 
CWLPLHLSRILKLTLYNQNDPNRCELLSFLLVLDYIGINMASLNSCINPIALYLVSKR 
FKNCFKSCLCCWCQSFEEKQSLEEKQSCLKFKANDHGYDNFRSSNKYSSS" 
489 a 434 c 438 g 511 t 



Query Match 39.3%; 
Best Local Similarity 99.6%; 
Matches 1695; Conservative 



Score 1690.8; DB 9; Length 1872; 
Pred. No. 9.3e-293; 
0; Mismatches 7; Indels 



Qy 

Db 



0 ; Gaps 



178 



0; 



237 



TGAAACTTGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGC 

M M IIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIMI 

171 TGTCTCTAGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGC 230 



Qy 
Db 

Qy 

Db 



238 ATGCAGCCGCCTCCAAGTCTGTGCGGACGCX3CCCTGGTTGCGCTGGTTCTTGCCTGCGGC 2 97 

IIIIMIMIIIIIIIIIIIIIIIMII III M M II! I I II M l 

231 ATGCAGCCGCCTCCAAGTCTGTGCGGACCGGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 2 90 
298 CTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTG 357 

IMIIMI Ml Ml I Ml M MMM MM, IM M I MM MM 

291 CTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTG 350 



Qy 


358 


Db 


351 


Qy 


418 


Db 


411 


Qy 


478 


Db 


471 


Qy 


538 


Db 


531 


Qy 


598 


Db 


591 


Qy 


658 


Db 


651 


Qy 


718 


Db 


711 


Qy 


778 


Db 


771 


Qy 


838 


Db 


831 


Qy 


898 


Db 


891 


Qy 


958 


Db 


951 


Qy 


1018 


Db 


1011 




1 07ft 


Db 


1071 


Qy ' 


1138 


Db 


1131 



CAAACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCC 417 

IMIIIIIMIIIMMIIIIIIIIIIIIIIMMIIIIMIIIIIIIIIIIIIIMIII 

CAAACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCC 410 



IMIMIIII I I1IIIIMIIIIIIIIIIMIIII IIIMMIIIMMI 



CCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 537 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIII 

CCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 530 
AAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCC 597 

M mmi I im ii M ;i ir ii II II II ii :n 

AAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCC 590 
ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 657 

IIIIIIIIIIMIMIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIII! 

ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 650 
GCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTAC 717 

IIIIMIIMIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIM 

GCC^GCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTAC 710 
AAGCTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATA 777 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIMIMI 

AAGCTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATA 770 
CAGAAAG CCTCCGTGGGAATCACTGTGCTGAGT CTATGTG CT CTGAGTATTGACAGATAT 837 

MIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIMIIIIIIIIMIII 

CAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATAT 830 
CGAGCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTA 8 97 

IIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

CGAGCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTA 8 90 
GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 957 

Ml Ml II I M' M II II Ih II ll III II MMMI 

GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 950 
GATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTT 1017 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

GATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTT 1010 
CAGAAGACAGCTTTCATGCAGTTTTACAAGACAGC^AAAGATTGGTGGCTGTTCAGTTTC 1077 

I I lllllllll 

CAGAAGACAGCTTTCATGC^GTTTTAC^GACAGCAAAAGATTGGTGGCTATTCAGTTTC 1070 
TATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATG 1137 

I 1 II M III I .11 M 'i; II MM h ,1 II. II ! 

TATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATG 113 0 
TTGAGAAAGAAAAGTGGCATG CAGATTG CTTTAAATGATCACCTAAAGCAGAGACGGGAA 1197 

l ! II I! :ih i II ,1: ii, II II, II I II II ;ll 

TTGAGAAAGAAAAGTGG CATG CAGATTGCTTTAAATGATCACCTAAAG CAGAGACGGGAA 1190 



Qy 1198 GTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCAC 12 57 







i i i i i i i i i i i i i i i i i t i i i j i i i i i i i i i t i i i i t i i i i i i i < i t i i i i < * i i i i i i i 

1 1 M 1 1 1 1 1 1 1 ! 1 M 1 1 1 1 1 1 1 1 II 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 II 1 




Db 


1191 


GTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCAC 


1250 


Qy 


1258 


CTCAGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTT 


1317 






1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1251 


CTCAG CAGGATTCTGAAG CT CACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTT 


1310 


Qy 


1318 


TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 


1377 






i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i t i i i i i i i i i i i t i i i i i i i i i i i i 

II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


1311 


TTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGC 


1370 


Qy 


1378 


ATTAACCCAATTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGC 


1437 






1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 




Db 


1371 


ATTAAC CCAATTG CTCTGTATTTGGTGAGCAAAAGATTCAAAAACTG CTTTAAGTCATGC 


1430 


Qy 


1438 


TTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGC 


1497 






1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 




Db 


1431 


TTATGCTGCTGGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGC 


1490 


Qy 


1498 


TTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGC 


1557 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


1491 


TTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGC 


1550 


Qy 


1558 


TCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCAT 


1617 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1551 


TCATCTTGAAAGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCAT 


1610 


Qy 


1618 


TAAAACAAAATGAAACATTTGCCAAAACAAAAC 


1677 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1611 


TAAAACAAAATGAAACATTTGCCAAAACA^ 


1670 


Qy 


1678 


TATTAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGC 


1737 






i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ■ ■ i i i i i i i i i i i i i > i i i i i i i 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1671 


TATTAAAATATTAAGTGTAATTATTTTAACACTCACAGCTACATATGACATTTTATGAGC 


1730 


Qy 


1738 


TGTTTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTT 


1797 






i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ■ i i i i i i i i i i i i i i i i i i i ■ i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


1731 


TGTTTACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTT 


1790 


Qv 


1798 


AATTTTTTAPAGTTAPPA PTTPA A PATAPPTPTTA A PA A PTTPP APP A T A TTP A PA PA A P 


1 OCT 
lO J / 






M 1 II MIMI IM M II M II Ml M II M II 1 Ml 




Db 


1791 


AATTTTTTACAGTTAGCACTTC^ACATAGCTCTTAACAACTTCCAGGATATTCACACAAC 


1850 


Qy 


1858 


ACTTAGGCTTAAAAATGAGCTC 1879 




Db 


1851 


Mill lllllllllllllll, 
ACTTAGGCTTAAAAATGAGCTC 1872 





Search completed: December 12, 2003, 19:39:19 
Job time : 15106.3 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



December 12, 2003, 09:17:01 ; Search time 7748.55 Seconds 

(without alignments) 
13490.718 Million cell updates/sec 



Title: US-09-931-157-2 
Perfect score: 4301 



Sequence : 



1 gagacattccggtgggggac ctgggaaaaaaaaaaaaaaa 4301 



Scoring table: I DENT I TY__NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



22781392 seqs, 12152238056 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



45562784 



Database : 



EST: 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 



em_estba : * 
em_esthum: * 
em_estin: * 
em_estmu: * 
em_estov: * 
em_estpl : * 
em_estro : * 
em_htc : * 
gb_estl : * 
gb_est2 : * 
gb_htc : * 
gb_est3 : * 
gb_est4 : * 
gb_est5 : * 
em_estfun: * 
em_estom: * 
em_gs s_hum : * 
em_gss_inv : * 
em_gss_pln: * 
em_gss_vrt : * 
em_gss_fun: * 
em__gss_mam: * 
em_gss_mus : * 
em_gss_pro: * 
em_gss_rod: * 
em_gs s_phg : * 
em_gss_vrl : * 



28 : 
29: 



gb_gssl : * 
gb_gss2 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 
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22 
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.4 
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32 


705 


16. 


.4 
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c 


33 
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.3 
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c 


34 


698.8 


16. 


,2 


726 


9 
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AI422064 tf57cl2.x 
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35 
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16. 


,2 
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,2 
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,8 


771 
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,6 
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,4 
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43 
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3 
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9 
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44 
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2 
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9 
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2 
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RESULT 1 
AK083415 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



AK083415 3878 bp mRNA linear HTC 05-DEC-2002 

Mus musculus 9 days embryo whole body cDNA, RIKEN full-length 
enriched library, clone :D03 0003K13 product : ENDOTHELIN B RECEPTOR 
PRECURSOR, full insert sequence. 
AK083415 

AK083415. 1 GI :26350536 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki , Y . 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,P., Shibata,Y., Hayatsu,N. , Sugahara,Y., Shibata,K., 
Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 
Normalization and subtraction of cap-trapper-selected cDNAs to 
prepare full-length cDNA libraries for rapid discovery of new genes 



Genome Res . 
20499374 
11042159 
3 

Shibata,K. , 



10 (10), 1617-1630 (2000) 



Itoh,M. , Aizawa, K. 
Konno,H., Akiyama,J., Nishi,K. 
Sumi,N., Ishii,Y., Nakamura,S. 



Carninci , P . , 
, Itoh,M., 



Nagaoka , S . , Sasaki , N . , 
Kitsunai,T. , Tashiro,H 
Hazama,M., Nishine,T., Harada,A., 
Yamamoto,R., Matsumoto, H. , Sakaguchi , S . , Ikegami,T., Kashiwagi , K. , 
Fujiwake,S. , Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Muramatsu,M. , Inoue,Y., Kira,A. and Hayashizaki, Y. 
RIKEN integrated sequence analysis (RISA) system- -3 84 -format 
sequencing pipeline with 384 multicapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 
11076861 
4 

Kawai,J., Shinagawa,A. , Shibata,K., Yoshino,M., Itoh,M. 
Arakawa,T., Hara,A., Fukunishi , Y . , Konno,H., Adachi,J., 
Aizawa , K. , Izawa,M., Nishi,K., Kiyosawa,H., Kondo,S., Yamanaka,I., 
Saito,T., Okazaki,Y., Gojobori,T., Bono,H., Kasukawa,T., Saito,R., 
Kadota,K., Matsuda,H., Ashburner,M. , Batalov,S., Casavant,T., 
Fleischmann,W. , Gaasterland, T. , Gissi,C, King,B., Kochiwa,H., 
Kuehl,P., Lewis, S. , Matsuo,Y., Nikaido,I., Pesole,G., 
Quackenbush, J. , Schriml , L.M. , Staubli,F., Suzuki, R. , Tomita,M w 
Wagner, L. , Washio,T., Sakai,K. , Okido,T., Furuno,M., Aono,H., 
Baldarelli,R. , Barsh,G. , Blake, J., Boffelli,D., Bojunga,N., 
Carninci, P., de Bonaldo,M. F. , Brownstein, M. J. , Bult,C, 



Ishii, Y. , 
Fukuda , S . , 



TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
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FEATURES 

source 



Fletcher, C, Fujita,M., Gariboldi , M . , Gust incich, S . , Hill,D., 
Hofmann,M., Hume, D. A., Kamiya,M., Lee, N . H . , Lyons, P., 
Marchionni , L. , Mashima,J., Mazzarelli , J. , Mombaerts , P . , Nordone,P., 
Ring,B., Ringwald,M., Rodriguez , I . , Sakamoto, N. , Sasaki, H., 
Sato,K. , Schonbach, C. , Seya,T. , Shibata,Y., S torch ,K.F., Suzuki, H. , 
Toyo-oka,K., Wang,K.H., Weitz,C, Whittaker , C. , Wilming,L., 
Wynshaw-Boris, A. , Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki , Y . 

Functional annotation of a full-length mouse cDNA collection 

Nature 409 (6821) , 685-690 (2001) 

21085660 

11217851 

5 

The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 (bases 1 to 3878) 

Adachi,J. f Aizawa,K., Akimura,T., Arakawa,T. , Bono,H. , Carninci,P., 
Fukuda,S., Furuno,M., Hanagaki,T., Hara,A., Hashizume,W. , 
Hayashida, K. , Hayatsu,N. , Hiramoto, K. , Hiraoka,T., Hirozane,T., 
Hori,F., lmotani,K., Ishii,Y., ltoh,M., Kagawa,I., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C, Matsuyama , T. , Miyazaki,A., Murata,M., 
Nakamura,M., Nishi,K., Nomura,K., Numazaki,R., Ohno,M., Ohsato,N., 
Okazaki,Y., Saito,R., Saitoh, H . , Sakai,C, Sakai,K., Sakazume,N. , 
Sano,H., Sasaki, D., Shibata,K., Shinagawa,A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A., Takahashi , F . , Takaku-Akahira, S. , 
Takeda,Y., Tanaka,T., Tomaru,A. , Toya,T., Yasunishi , A. , 
Muramatsu,M. and Hayashizaki, Y. 
Direct Submission 

Submitted (16-APR-2002) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome-res@gsc . riken. go. jp, 
URL :http: //genome .gsc . riken. go. jp/ , Tel : 81-45-503-9222, 
Fax:81-45-503-9216) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : / / genome . gsc . riken . go . jp/ 
URL : http : // fantom . gsc . riken. go . jp/ . 

Locat ion/ Qual i f iers 

1. .3878 

/organism= 11 Mus musculus" 

/mol_type= n mRNA" 

/strain="C57BL/6J" 

/ db_xr e f = " FANT0M_DB :D030003K13" 

/db_xref="taxon: 10090" 

/clone="D030003K13" 

/tissue_type= "whole body" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 



CDS 



polyA_signal 



polyA_site 



BASE COUNT 
ORIGIN 



1109 a 



/dev_stage="9 days embryo" 
109. .1437 

/no te= "unnamed protein product; ENDOTHELIN B RECEPTOR 

PRECURSOR (SWISSPROT| P48302, evidence: FASTY, 100%ID, 

100%length / match=1326) 

putative" 

/ codon_start=l 

/protein_id="BAC38908 .1" 

/db_xref="GI : 26350537" 

/ 1 rans la t ion= " MQS PASRCGRALVALLLACGFLGVWGEKRGFPPAQATLSLLGTK 
EVMTPPTKTSWTRGSNSSLMRSSAPAEVTKGGRGAGVPPRSFPPPCQRNIEISKTFKY 
INTI VSCLVFVLGI I GNS TLLR 1 1 Y KN KCMRNG PN I L I AS LALGDLLH 1 1 IDI PINTY 
KLLAEDWPFGAEMCKLVPFIQKASVGITVTjSLCALSIDRYRAVASWSRIKGIGVPKWT 
AVEIVTjIWWSWLAVPEAIGFDMITSDYKGKPLRVCMLNPFQKTAFMQFYKTAKDWW 

lfsfyfclplaitavfytlmtcemlrkksgmqialndhlkqrrevaktvfclvlvfal 
cwlplhlsrilkltlydqsnphrcellsfllvldyiginmaslnscinpialylvskr 
fkncfksclccwcqtfeekqsleekqsclkfkandhgydnfrssnkysss" 

3859. .3864 
/not e= "putative" 
3878 

/note=" putative" 

861 c 812 g 1096 t 



Query Match 31.9%; 
Best Local Similarity 66.0%; 
Matches 2665; Conservative 



Score 1372.2; DB 11; 
Pred. No. 2.3e-184; 
0; Mismatches 1043; Indels 



Length 3878; 

327; Gaps 



34; 



Qy 

Db 



193 AAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATGCAGCCGCCTCCA 252 

1 1 1 1 II lllllll III III I I MINI I I I MINI II II II 

64 AAACAGCAGAGCGGCTACCAGACTCTCACAGGAGC 123 



Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 



2 53 AGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTGTCGCGGATCTGG 312 

II I I 1 1 1 1 MINI I II Mill III I I I II I Ml 

124 AGCCGGTGCGGACGCGCCTTGGTGGCGCTGCTGCTGGCCTGTGGCTTCTTGGGGGTATGG 183 



313 



184 



370 



GGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTC- - -CGCTTTTGCAAACCGCAGAG 

HUM I Mill Mill MM I Mill I I III I II MM 

GGAGAGAAAAGAGGATTCCCACCTGCCCAAGCCACGCTGTCACTTCTCGGGACTAAAGAG 



ATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGTCTGGCGCGG 

M 1 1 1 1 M M 1 1 1 1 1 II I II 1 1 1 1 III III I HI II II Ml 

244 GTAATGACGCCACCCACTAAGACCTCCTGGACCAGAGGTTCCAACTCCAGTCTGATGCGT 



369 



243 



429 



303 



430 



304 



490 



TCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCGCCACGCACC 4 8 9 

II I MIIIIIIMIIIII I lllllll III III III MUM I I 

TCCTCCGCACCTGCGGAGGTGACCAAAGGAGGGAGGGGGGCTGGAGTCCCGCCAAGATC - 362 



ATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATCAAC 

II MUM MINIMI II II lllllll lllllll IMIIIIIIMI 

363 - - CTTCCCTCCTCCGTGCCAACGAAATATTGAGATCAGCAAGACTTTTAAATACATCAAC 



549 



420 



550 ACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGA 609 

Ml lllllll Mill IMMIMIM II II M 1 1 1 1 1 II I II II I M II III 

421 ACGATTGTGTCGTGCCTCGTGTTCGTGCTAGGCATCATCGGGAACTCCACGCTGCTAAGA 4 80 
610 ATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCT 669 



II Ml U! 11:11 'II II I II II J Ml IIIMI Mill 

^TCATCTACAAGAACAAGTGCATGCGCAATGGTCCCAATATCTTGATCGCCAGTCTGGCT 540 



Db 


481 


ATCATCTACAAGAACAAGTGCATGCGCAATGGTCCCAATATCTTGATC 


540 


Qy 


670 


CTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCA 

IMIIIIIIII MINIMI MM Mill II II II IMMM MM Ml 

CTGGGAGACCTACTGCACATCATCATAGACATACCCATTAACACCTACAAGTTGCTCGCA 


729 


Db 


541 


600 


Qy 


730 


GAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCC 

II 1 1 1 1 1 II 1 II 1 1 II 1 II II 1 II 1 1 1 1 1 1 1 II II 1 II 1 1 1 MMIMIMI II II 
GAGGACTGG CCATTTGGAG CTGAGATGTGTAAG CTGGTGCCCTTCATACAGAAGG CTTCT 


789 


Db 


601 


660 


Qy 
Db 


790 
661 


GTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCT 

IMIIIIIIII IIMMIMM II Mill III 111:1 llllh 1 

GTGGGAATCACAGTGCTGAGTCTTTGTGCTCTAAGTATTGACAGATATCGAGCTGTTGCT 


849 
720 


Qy 


850 


TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 

IIIIIMM IMMiMI 1 II 1 MIMI M IM 1 II M 

TCTTGGAGTCGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTA 


909 


Db 


721 


780 


Qy 

Db 


910 
781 


ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 

II II 1 1 M 1 1 1 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 II 1 1 MMMMMIMMIMM 1 

ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCCGAAGCCATAGGTTTTGATATGATTACG 


969 
840 


Qy 


970 


ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 

1 MM 1 II || | Mill Mill Mill IMMM Mill 

TCGGACTACAAAGGAAAGCCCCTAAGGGTCTGCATGCTTAATCCCTTTCAGAAAACAGCC 


1029 


Db 


841 


900 


Qy 


1030 


TTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 

Ml IM 1 II 1 1 M MMM IMMM Ml 1 IMMIMI 

TTCATGCAGTTTTACAAGACAGCCAAAGATTGGTGGCTGTTCAGTTTCTACTTCTGCTTG 


1089 


Db 


901 


960 


Qy 


1090 


CCATTGGC CATCACTG CATTTTTTTATACACTAATGACCTGTGAAATGTTGAGAAAGAAA 

M 1 1,1: 1 IMIIIII II MIIIMI MIMI 1 M Mill 

CCGCTAGCCATCACTGCAGTCTTTTATACCCTGATGACCTGCGAAATGCTCAGGAAGAAG 


1149 


Db 


961 


1020 


Qy 


1150 


AGTGGCATG CAGATTG CTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAACC 

II M II M IM II IMMIMI II II 1 1 II II 1 II 

AG CGGTATG CAGATTG CTTTGAATGATCACTTAAAGCAGAGACGAGAAGTGGC CAAGACA 


1209 


Db 


1021 


1080 


Qy 


1210 


GTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATT 

Mill IMMMIMI II Mill Mill MMMMMMMMIMM MM 

GTCTTCTGCCTGGTCCTCGTGTTTGCTCTCTGTTGGCTTCCCCTTCACCTCAGCCGGATC 


1269 


Db 


1081 


1140 


Qy 


1270 


CTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTG 

IMMMIMI II III 1 MM MM 1 M Mill III MIIIMI II 

CTGAAGCTCACCCTGTATGACCAGAGCAATCCACACAGGTGTGAGCTTCTGAGCTTTTTG 


1329 


Db 


1141 


1200 


Qy 


1330 


TTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAATT 

1 1 1 1 1 MIIIMI 1 II 1 II 1 II II II 1 1 1 1 1 II 1 1 1 1 IMIIIII II Mill 

1 1 I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 It 1 II 1 II 1 I 

TTGGTTTTGGACTACATTGGTATCAACATGGCTTCTTTGAACTCCTGCATCAATCCAATC 


1389 


Db 


1201 


1260 


Qy 


1390 


GCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTGG 

M M 1 II 1 II 1 1 II 1 1 1 1 1 II 1 1 1 II II 1 II 1 II 1 1 1 1 1 II 1 1 II 1 1 II IIIIIMM 

GCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGTTTGTGCTGCTGG 


1449 


Db 


1261 


1320 


Qy 


1450 


TGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAAA 


1509 



Mill I Mill Mill IMIMIMIMM IMIIIII III I IMMIMI 



Db 



1321 TGCCAAACGTTTGAGGAAAAGCAGTCCTTGGAGGAGAAGCAGTCCTGCCTGAAGTTCAAA 1380 



Qy 1510 GCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAG 1569 

II II lllllllllllllllllllllll Mill Mil lllllll I 

Db 1381 GCCAACGATCACGGATATGACAACTTCCGGTCCAGCAATAAATACAGCTCGTCTTGAAG^ 144 0 

Qy 1570 AAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAAAACAAAATG 1629 

I I I II I I II III I II I II Mill M MMMIMMIMI 

Db 1441 CAAGAACACTCGCCGAATCTCACTGTCCTCATTGTGGACAGATACCATTAAAACAAAATG 1500 

Qy 1630 AAACATTTGCCAAAACAAAACAAAAAACTATGTATTTGCACAGCACACTA 168 9 

MM ; Ml Mill MM I I III I I II 

Db 15 01 AAACCGTTGCCAAATCAAAATGGAAAAAACCATGCTAGCAGAAAGGTGTGCGCGCGTGTG 1560 

Qy 1690 AAGTGTAATTATTTTAACACTCACAG CTACATATGAC ATTTTATGAG CTGTTTAC 1744 

I III lllllll I I II II I I Mill 

Db 1561 AGAGGGATTATTTTTAACTGTTCTGACGCTCAACACCGGATATATTCACGGGCTGTTTAC 162 0 

Qy 1745 GGCATGGAAAGAAAATCAGTGGGAATTAAGAAAG C CTCGTCGTGAAAG CACTTAATTTTT 18 04 

I I II II Mill III MM I I I I II I 

Db 1621 AACCTAAGAAAGCTGTGGGAAGGAATGAAGCCCTCCTCCGTGGGGAAGCACTTAGATTCT 1680 

Qy 1805 TACAGTTAGCACTT(^CATAGCTCTTAACAACTTCCAGGATATTCACACAACACTTAGG 1864 

I III 1 1 1 1 1 1 1 1 1 II MINI! II I II 1 1 1 1 1 1 1 1 1 1 1 1 I 

Db 1 6 8 1 T - - AGTCAGCACTTCAGCAGAGCTCTTAAAAGCCCCTAGTGCGTTCACATGCCACTTACG 1738 

Qy 1865 CTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTAAATC 1924 

lllllll I MM III I I II II II I I 

Db 173 9 TTTAAAAA AACGAGAACTTCACTGAAGTTCTGTTCAGGAGTTTATTATCCAGT 1791 

Qy 1925 AATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATGAAGC 1984 

I II MM I III III II I II II MUM III II I 

Db 1792 CCTATGAATCTGGATTCAAGAAAGCAT - -GACATTGCAAAACAATTCTTAAAACGAAGTT 184 9 

Qy 1985 TTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATATTATC 2 044 

I II I I I II III II 1 1 1 1 I I I III II III 1 1 1 1 

Db 1850 TCAATTGCTTAATTTGAAACTTAAAAAAAAAAAAACTAATAAATTTTTATGCATACTATC 1909 

Qy 2 0 4 5 - - ACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAG CAGTTTAGTTGTTG CA - TT 2101 

II I I III MINIM II lllllll II I III I I MM I I II 

Db 1910 ATACCCACTAATCTGATTGTAACTATATGCAAAAGAAAAGGCAATATGGTTGGTAAACTT 1969 

Qy 2102 TTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGTT 2161 

Ml II II I MUM llllllllll I I Mill 

Db 1970 TTTTGGTCATTACCAACATTGAAATGATCAGAATTCGGGGGAAGAAA 2 016 

Qy 2162 TTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGC^TTCTGCAATATGTAACC 2221 

I I II 

Db 2017 AGACAGCC 2024 

Qy 2222 AAC^TGTCACAAACAAGCAGCATGTAAC^GACTGGC^CATGTGCCAGCTGAATTTAAAAT 2281 

I I II I I I MM I II II 

Db 2025 TGCGAATG CCACAGAGAAAACATGGGAAAG CGTG 2 058 

Qy 2282 ATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCAC 2341 

III III II MM II I I II 

Db 2 059 AGCTGCTATGCCTGAGACTTCTGAAATTCCCTCACACATACTCTGCAG 2106 



Qy 


2342 


AAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCAC 


2401 


Db 


2107 


Mill III III 1 1 1 II II III II 1 1 1 

AAAGACACAAA ACAGAACACTACCTATGATTTCTTTAAAGTTCTTTCAAAT 


2157 


Qy 


2402 


ATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTTT 

II 1 1 Illlllllll MM 1 II 1 

ATCCTTTCATGATTGAAGTTTAAATTCCATGTGTTCAACTTCATCA 


2461 


Db 


2158 


2203 


Qy 


2462 


CACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACAC 

1 III 1 II Ml III 1 III 

T CTGTAAATACTTAG CTATTAG CTATAAG CAC 


2521 


Db 


2204 


2235 


Qy 


2522 


TGCATGTAGATGATTAAATGA - - GGG CAGG CCCTGTG CTCATAGCTTTACGATGGAGAGA 

1 II Mill II 1 II 1 II 1 III 1 1 II lllllll II Mill 

TACACGTAGAGGACTTAACAAAGGG CAGGT CCCAG CGTTCGTAGCTTT CTGACAAAGAGA 


2579 


Db 


2236 


2295 


Qy 
Db 


2580 
2296 


TG CCAGTGAC CT CATAAT - - AAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAA 

IMIMI Ml 1 II 1 III Mil MM 1 M Mill! II MM 

TGCCAGTAACCCGGTTATAGACAGAATGTGAATTGCCCGGTGCAGTGTCCACATGGCAAA 


2637 
2355 


Qy 


2638 


GGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTAT 

1 Mill MM 1 II III 1 IIIM 1 llllllll Ml 1 

GAAG CAGGGAG CAT C - - CTTTCAGCCATGCTGTAGAGAAAATGGTCCACAGC AC 


2697 


Db 


2356 


2407 


Qy 


2698 


AATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGT-ACATTTAACAGCTA 

III MM lllllll 1 II II MM 1 1 MM II 1 1 Mill 

AATATGATAGCGAAAATACCGTGGTTTAACGCCATAGAAAATAGTCACTGTAACCAGCTC 


2756 


Db 


2408 


2467 


Qy 


2757 


CCTGTAAAGCTTATTACTAA - TTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTT 

II 1 II II III II Mill 1 Mill II II MM II 1 1 

TCTCGGAGGCATACTACCAACTTTTTATGTTATTCCTGAAAATAGCCAATAGAAAGGCGT 


2815 


Db 


2468 


2527 


Qy 


2816 


GCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAAC 

II MIMIMMIMI III II II II 1 1 MM 1 

TCTGGAC^TGGTGCTTTTTCTAAAACGTAGAAGCCAAACTGCTTCGGGGTCTGCAAGATC 


2875 


Db 


2528 


2587 


Qy 


2876 


CT CTTAG CTTTGTG CGTTCCTG CCTAATTTTTATAT CTT CTAAG CAAA 

III 1 Mill II Ml II III MM 1 1 II 1 II 

CTCCT - - CTTTGCGCATTCTTGTCTAGGTTTTTTTTTTTTTTTTTTAATCTCCTTCCACG 


2923 


Db 


2588 


2645 


Qy 


2924 


- - GTGCCTTAGGATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGA 

1 II 1 1 II M 1 1 II lllllll lllllllllll Ml lllllllllll II 

ACGTGCCTTAGGTTCACTCCGGATGAGCGGTGTGTGAAAGAATGCCCAAGAGAAAACTGA 


2981 


Db 


2646 


2705 


Qy 


2982 


AGAGAGAGGAAATGAGGTGGGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGC 

1 M 1 II MM MUM III MUM 1 IIIIIIIIIIIIIII 

AGAGAGAGGAAATGAGGTGGGG CCAGAGGAAG CCCGTGGGGAAATATTC CCATTCTTAGC 


3041 


Db 


2706 


2765 


Qy 


3042 


CTAACGTTCGTC^TTGCCTCGTC^(^TCAATGC^W\AGGTCCTGATTTTGTTCCAGCAA^ 

1 MMMM MM IIIM II II Illlllllll II 1 lllllllll 

CCTGTGTTCGTCACTGCCACGTCATGTCGGTGTGAAAGGTCCTGGTTCGGCTCCAGCAAA 


3101 


Db 


2766 


2825 


Qy 


3102 


ACACAGTGCAATGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTC 

Ml II III llllllll Mill 1 1 II Ml MM Ml M 

AO^GCGC^GCGTTCT(^GCGTGAC-TCGGGAACAAACCAAGCCCGAGAGCTTTAACCT 


3161 


Db 


2826 


2884 



r 



Qy 


3162 


Db 


2885 


Qy 


3186 


Db 


2945 


Qy 


3221 


Db 


3005 


Qy 


3281 


Db 


3065 


Qy 


3341 


Db 


3121 


Qy 


3401 


Db 


3162 


Qy 


3460 


Db 


3222 


Qy 


3520 


Db 


3281 


Qy 


3572 


Db 


3341 


Qy 


3632 


Db 


3401 


Qy 


3692 


Db 


3455 


Qy 


3752 


Db 


3515 


Qy 


3812 


Db 


j j ' ^ 


Qy 


3872 


Db 


3632 


Qy 


3932 



llllllllllll I I III 



TACTTTGTTTTT CTTTTAATAGG CTGGG CCACATG 322 0 

I II I I IMIIII III I Illllllll 
TTCTCTTCTCTTCTCTTCTCTTCTCTTCTCTTCTCTTCTTTTCATAACCCAGG CCACATG 3004 

TTGGAAATAAGCTAGTAATGTTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACC 3280 

III I I I I I I I I I III I I II I IIMI I I I I Ml 

TTGAAAATGAGCTTAACAATGC^GTTTTCTACCAAAATC^TTGTGAC^TAa^TAAACC 3 064 

AAAA C C CAA CAATG TGG C CAG AAAGAAAG AG CAATAATAATTAATT CA CA CA C CAT ATGG 334 0 

1 1 1 1 llllll II I III 1 1 1 1 Mill I I I MM I 

CAAACGGGAC^TGAGGTAAAAAACCAAGAAC^TACTGAATCCACGTGACAC ATG 312 0 

ATTCTATTTATAAATCACCCACAAACTTGTTCT 34 00 

I III MM I 1 1 1 1 1 ! 1 1 1 I I I II I 

ACTCTCTTTAGGAGTCACCCACAGTTCTTGTGTGTA CAGAT 3161 

AGGCCTGTTATCATAGAAGTCATTTTAGACTCTCAATTTTAAATTAATT - TTGAATCACT 34 59 

I I I llllll III III 11,111 III II III ll I 

TGCTTTTTAATCATAAAGGACGCCCCAGATCTTCAATTTTAAGTTAGTTATTGGCTCCCC 3221 
AATATTTTCACAGTTTATTAATATATTTAATTTCTATTTAAATTTTAGATTATTTTTATT 3519 

I II MUM I MM II! || III Illllllll IMIIII 

AGTAGTTTCACAGCGTGGATATATTTTTAATTTTTA - CTAAGTTTTAGATTGGTTTTATT 3280 
ACCATGTACTGAATTTTTACATCCTGATACCCTTTCCTTCTCCATGT CAGTA 3571 

III II MM III MM I I I II I III I I 

GTTGTGTTCTAAATTCTTAAGTCCTAACATCTTTGTTTAACCCAGATGTTCCTTCCCTCT 334 0 
TCATGTTCTCTAATTATCTTGCCAAATT^ 3631 

Mill I MM II Illllllll Mill II I II I II I II 

TC^TGGGC^TAATCGTCCTGCC^AATTATGAAATGGCATAAGAATACTATTCACATAAT 34 00 
ATTTATAATAAAATTGCATTC^GTGGCTTTTTAAAAAAAATGTTTGATTCMAACTTTAA 3691 

II II llllll! I III lllllllllll I MM III II I 

ATATACAATAAAACTATATTAAGTGGCTTTTTTATTAAAAATTTTAGCACA CAG 3454 

CATACTGATAAGTAAGAAACAATTATAATTTCTTTACATACTCAAAACCAAGATAGAAAA 3751 

I I I I I Ml II II III I II MM I IIIIIIIMIII I I ' 

ACC^AGGGTGATAAGAAAAAAAACATGATTCCCTTGCATAATTAAAACCAAGATAAGAGA 3514 
AGGTGCTATCGTTCAACTTCAAAACATGTTTCCTAGTATTAAGGACTTTAATATAGCAAC 3811 

MM I III II II II MM I Mill I i 1 1 1 1 1 ' 1 1 

AGGTACCATCT AATTTAAAGCATATTTTCTAACATTTAAGTAGCCTAATATAGCAAT 3571 

AGACAAAATTATTGTTAACATGGATGTTACAGCTCAAAAGATTTATAAAAGATTTTAACC 3871 

I MM II 1 1 1 f 1 1 1 i i II Mill Mill Ml II II I II 

G CATAAAAATAGTGTTAACAAGGATGTTAGAGGTCAAACGATTTGTAAGTGACTTCAG CC 3631 
TATTTTCTCCCTTATTATCCACTGCTAATGTGGATGTATGTTCAAACACCTTTTAGTATT 3 931 

1,11111 Mill IMIIII I I I I I 1 1 1 1 1 1 i I J I III llllll 

TATTTTCTCCCAAATTATTTACTGCTATTTTTGGTCTGTGTTCAAACA- TTTTCAGTATT 3690 
3 932 GATAGCTTACATATGGCCAAAGGAATAC^GTTTATAGC^WyVCATGGGTATGCTGTAGCT 3991 



1 1 1 1 I II I I Ml. 1 ! 1 1 1 1 1 1 I I III 1 1 1 1 II 1 1 1 1 II 

Db 3691 GATAATGTGCA-ACAGCOW^GGAACACTGTTTTCATCCAAATGCGGGTGTGTTGTACCT 374 9 

Qy 3992 AACTTTATAAAAGTGTAATATAACAATGTAAAAAATTATATATCTGGGAGGATTTTTTGG 4 051 

III II I III | | III || | | I I I III 

Db 3750 AAC ATGCACTTGTAATAAAGCCGTGTAAAA TAACTGTGTTTTGTTTTGCTCTGG 3803 

Qy 4 052 TTGCCTAAAGTGGC TATAGTTACTGATTTTTTATTATGTAAGCAAAACCAATAA 4105 

I II I II 1 1 II 1 1 I I III II I II II II lllllllllllllllll 

Db 3804 TCACCTAAAGTGGCAGCTTGTGTCGTTGCTAACTTCTTGTTGAGTAAGCAAAACCAATAA 3863 

Qy 4106 AAATTTAAGTTTTTT 412 0 

I Mill III 
Db 3864 ACGTTCAAATGGTTT 3878 
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Functional annotation of a full-length mouse cDNA collection 
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21085660 
11217851 
5 

The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 (bases 1 to 3990) 
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FEATURES 

source 



CDS 



polyA_signal 
polyA_site 



BASE COUNT 
ORIGIN 



URL : ht tp : / / f antom . gsc . r iken . go . j p/ . 
Location/Qualifiers 
1. .3990 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/db_xref = "FANTOM_DB:D630038G12" 

/db_xref ="taxon: 10090" 

/clone="D63 0038G12" 

/t issue_type= "kidney" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="0 day neonate" 
222. .1550 

/note=" unnamed protein product; ENDOTHELIN B RECEPTOR 

PRECURSOR (SWISSPROT| P48302, evidence: FASTY , 100%ID, 

100%length, match=1326) 

putative" 

/ codon_s t a rt = 1 

/protein_id="BAC39465 .1" 

/db_xref="GI : 26351657" 

/ translat ion= "MQSPASRCGRALVALLLACGFLGWGEKRGFPPAQATLSLLGTK 
E VMT P PTKTSWTRGSNS SLMRS S APAE VTKGGRGAGVP PRS F P P PCQRN I E I S KTFKY 
INTIVSCLVFVLGI I GN S TLLR I I Y KN KCMRNG PN I L I AS LALGDLLH I I IDI PINTY 
KLLAEDWPFGAEMCKLVPF I QKAS VGI TVLSLCALS I DRYRAVASWSRI KGI GVPKWT 
AVEIVXIWWSWIAVPEAIGFDMITSDYKGKPLRVCMLNPFQKTAFMQFYKTAKDWW 
LFSFYFCLPLAITAVFYTLMTCEMLRKKSGMQIALNDHLKQRREVAKTVFCLVLVFAL 
CWLPLHLSRILKLTLYDQSNPHRCELLSFLLVIjDYIGINMASLNSCINPIALYLVSKR 
FKNCFKSCLCCWCQTFEEKQSLEEKQSCLKFKANDHGYDNFRSSNKYSSS" 
3972. .3977 
/ not e= "putat ive " 
3990 

/ note= "putat ive " 
1136 a 885 c 850 g 1119 t 



Query Match 31.9%; 
Best Local Similarity 66.0%; 
Matches 2664; Conservative 



Score 1371.2; DB 11; Length 3990; 
Pred. No. 3.2e-184; 
0; Mismatches 1043; Indels 327; Gaps 



34; 



Qy 
Db 



193 AAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATGCAGCCGCCTCCA 252 

MM M MMMI Ml Ml I ! MMM I I I MMM MM M 

177 AAACAG CAGAGCGG CTACCAGACT CTCACAGGAG CAAG CTGTAACATG CAATCG CCCGCA 236 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 



253 AGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTGTCGCGGATCTGG 

M I MMMMMMI MM MMM I M MMI Ml I I I M I Ml 

237 AGCCGGTGCGGACGCGCCTTGGTGGCGCTGCTGCTGGCCTGTGGCTTCTTGGGGGTATGG 



313 



297 



370 



GGAGAGGAGAGAGG CTT CCCG CCTGACAGGG CCACTC - - - CGCTTTTGCAAACCGCAGAG 

MMM I MMI Mill MM I Mill I I III I M MM 

GGAGAGAAAAGAGGATTCCCACCTGCCCAAGCCACGCTGTCACTTCTCGGGACTAAAGAG 



312 



296 



369 



356 



ATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGTCTGGCGCGG 429 

MIMIIIIIIIIIIIIIIIIIII III III I'MI I'll Ml 

357 GTAATGACGCCACCCACTAAGACCTCCTGGACCAGAGGTTCCAACTCCAGTCTGATGCGT 416 



430 TCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCGCCACGCACC 

II I 1 1 1 1 II M II I M 1 1 I MMMI Ml III III MMM I I 



489 



Db 



417 TCCTC CGCACCTG CGGAGGTGACCAAAGGAGGGAGGGGGG CTGGAGTCCCG CCAAGATC - 475 



Qy 


490 


Db 


476 


Qy 


550 


Db 


534 


Qy 


610 


Db 


594 


Qy 


670 


Db 


654 


Qy 


730 


Db 


714 


Qy 


790 


Db 


774 


Qy 


850 


Db 


834 


Qy 


910 


Db 


894 


Qy 


970 


Db 


954 


Qy 


1030 


Db 


1014 


Qy 


1090 


Db 


1074 


Qy 


1150 


Db 


11^4 


Qy 


1210 


Db 


1194 


Qy 


1270 


Db 


1254 



ATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATCAAC 54 9 

II MINI lllllllll II II lllllll lllllll MINIMUM 

- - CTTCCCTCCTCCGTGCCAACGAAATATTGAGATCAGCAAGACTTTTAAATACATCAAC 533 
ACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGA 609 

III lllllll Mill lllllllllll II lllllllllllllllll II II III 

ACGATTGTGTCGTGCCTCGTGTTCGTGCTAGGCATCATCGGGAACTCCACGCTGCTAAGA 593 
ATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCT 669 

II M'll II II II I 'I M II 'II I; II I I Mill 

ATCATCTACAAGAACAAGTGCATGCGCAATGGTCCa\ATATCTTGATCGCCAGTCTGGCT 653 
CTGGGAGACCTGCTGCACATCGTCATTGAC^TCCCTATCAATGTCTACAAGCTGCTGGCA 729 

lllllllllll lllllllll MM Mill II II II lllllll MM III 

CTGGGAGACCTACTGCACATC^TCATAGACATACCC^TTAACACCTACAAGTTGCTCGCA 713 
GAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCC 789 

! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMIIIII M II 

GAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCCTTCATACAGAAGGCTTCT 773 
GTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCT 84 9 

lllllllllll MMMIIIII llllllll IIIIIIIIIIIIIIIMIIIIIIMII 

GTGGGAATCACAGTGCTGAGTCTTTGTGCTCTAAGTATTGACAGATATCGAGCTGTTGCT 833 
TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 909 

lllllllll IUI IT I M III II Ml M II ll II II I 

TCTTGGAGTCGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTA 893 
ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 969 

I II 1 1 1 II II 1 1 1 1 1 II II II I II II II M 1 1 : ■ I/ MMM 

ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCCGAAGCCATAGGTTTTGATATGATTACG 953 
ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 102 9 

llllllllllllll II I Mill Mill Mill lllllll Mill 

TCGGACTACAAAGGAAAGCCCCTAAGGGTCTGCATGCTTAATCCCTTTCAGAAAACAGCC 1013 



II I II II II 1 1 II I II II 1 1 II I llllllllllllllllllllllllll llllllll 



M I llllllllllll I llllllll II llllllll llllll I II Mill 



AGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAACC 1209 

M M llllllllllllll lllllllll lllllllllllll lllllllllll II 

AGCGGTATGCAGATTGCTTTGAATGATCACTTAAAGCAGAGACGAGAAGTGGCCAAGACA 1193 



Mill lllllllllll II Mill Mill llllllllllllllllllll! MM 



CTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTG 132 9 

lllllllllll II III I MM MM I II Mill III llllllll II 

CTGAAGCTCACCCTGTATGACCAGAGCAATCCACACAGGTGTGAGCTTCTGAGCTTTTTG 1313 



Qy 1330 TTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAATT 138 9 

Mill MINIM MIMIIIIIIMIIIIIII MM MMMM II Mill 

Db 1314 TTGGTTTTGGACTACATTGGTATCAACATGGCTTCTTTGAACTCCTGCATCAATCCAATC 1373 

Qy 13 90 GCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTGG 144 9 

I f i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 ! 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 II MMMMI 

Db 1374 GCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGTTTGTGCTGCTGG 1433 

Qy 14 50 TGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAAA 1509 

Mill I Mill Mill MMMMMMM MMMM Ml I MMMMI 

Db 1434 TG CCAAACGTTTGAGGAAAAG CAGTCCTTGGAGGAGAAGCAGTC CTG CCTGAAGTTCAAA 14 93 

Qy 1510 GCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAG 1569 

ii i r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 iiiiiiiiiiiiii 1 1 1 1 1 1 1 i 

Db 14 94 GCCAACGATCACGGATATGACAACTTCCGGTCCAGCAATAAATACAGCTCGTCTTGAAGG 1553 

Qy 1570 AAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAAAACAAAATG 1629 

I I I II I I II III I II I II Mill II MMMMMIMM 

Db 1554 CAAGAACACTCGCCGAATCTCACTGTCCTCATTGTGGACAGATACCATTAAAACAAAATG 1613 

Qy 1630 AAACATTTGCCAAAACAAAACAAAAAACTATGTAT^ 1689 

1 1 1 1 MINIM Mill MM I I III I I II 

Db 1614 AAACCGTTGCCAAATCAAAATGGAAAAAACCATGCTAGCAGAAAGGTGTGCGCGCGTGTG 1673 
Qy 1690 AAGTGTAATTATTTTAACACTCACAGCTACATATGAC ATTTTATGAGCTGTTTAC 1744 

I III M 'I I I II MM lllllllll 

Db 1674 AGAGGGATTATTTTTAACTGTTCTGACGCTCAACACCGGATATATTCACGGGCTGTTTAC 1733 

Qy 174 5 GGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAATTTTT 1804 

MM II Mill III MM I I lllllllll II I 

Db 1734 AACCTAAGAAAG CTGTGGGAAGGAATGAAG C CCTCCT CCGTGGGGAAG CACTTAGATTCT 1793 

Qy 1805 TACAGTTAGCACTTCAACATAGCTCTTAACAACT^ 1864 

I Ml lllllllll II MMMMI I I I II MUM llllll I 

Db 1794 T - - AGTCAGCACTTCAGCAGAGCTCTTAAAAGCCCCTAGTGCGTTCACATGCCACTTACG 1851 

Qy 1865 CTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTAAATC 1924 

1 1 1 1 1 1 1 I MM III I I II II II I I 

Db 1852 TTTAAAAA AACGAGAACTTCACTGAAGTTCTGTTCAGGAGTTTATTATCCAGT 1904 

Qy 1925 AATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATGAAGC 1984 

I II MM I III III II I II II llllll III II I 

Db 1905 CCTATGAATCTGGATTCAAGAAAGCAT- -GACATTGCAAAACAATTCTTAAAACGAAGTT 1962 

Qy 1985 TTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATATTATC 2 044 

MM II II III II MM llllll II III MM 

Db 1963 TCAATTGCTTAATTTGAAACTTAAAAAAAAAAAAACTAATAAATTTTTATGCATACTATC 2 022 

Qy 2045 --ACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCA-TT 2101 

II I I Ml MMMM II lllllll II I III I I MM I I II 

Db 2 023 ATACCCACTAATCTGATTGTAACTATATGCAAAAGAAAAGGCAATATGGTTGGTAAACTT 2 082 

Qy 2102 TTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTGTT 2161 

Ml II II I llllll Mllllllll I I Mill 

Db 2083 TTTTGGTCATTACCAACATTGAAATGATCAGAATTCGGGGGAAGAAA 2129 



Qy 2162 TTTGAAAATCATTACACTTra 2221 

I I II 

Db 2130 AGACAGCC 2137 

Qy 2222 AACATGTCACAAACAAG CAGCATGTAACAGACTGG CACATGTGCCAG CTGAATTTAAAAT 22 81 

I I II I I I I I I I I II II 
Db 2138 TGCGAATGC CACAGAGAAAACATGGGAAAG CGTG 2171 

Qy 2282 ATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTCAC 2341 

III III II I I I I II I I II 

Db 2172 AGCTGCTATGCCTGAGACTTCTGAAATTCCCTCACACATACTCTGCAG 2219 

Qy 2342 AAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCAC 24 01 

Mill III III I I I II II III II I I I 

Db 222 0 AAAGACACAAA ACAGAACACTACCTATGATTTCTTTAAAGTTCTTTCAAAT 2270 

Qy 24 02 ATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTTT 24 61 

II I I I II II I I II I MM I II I 

Db 2271 ATCCTTTCATGATTGAAGTTTAAATTCCATGTGTTCAACTTCATCA 2316 

Qy 24 62 CACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACAC 2521 

I III I II Ml III I III 

Db 2317 TCTGTAAATACTTAGCTATTAG CTATAAGCAC 2348 

Qy 2522 TGCATGTAGATGATTAAATGA - - GGG CAGG CCCTGTGCTCATAG CTTTACGATGGAGAGA 2579 

I II Mill II I II I II I III I I II IIIIIII II Mill 
Db 234 9 TACACGTAGAGGACTTAACAAAGGGCAGGTCCCAGCGTTCGTAGCTTTCTGACAAAGAGA 24 08 

Qy 2580 TGCCAGTGACCTCATAAT - - AAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACAAA 2 637 

MIMM Ml III I III MUM MM MMMIMIMIMM MM 

Db 24 09 TGCCAGTAACCCGGTTATAGACAGAATGTGAATTGCCCGGTGCAGTGTCCACATGGCAAA 2468 

Qy 2638 GGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGTAT 2 697 

I Mill MM I II III MIMIIII I llllllll III I 

Db 24 69 GAAGCAGGGAGCATC - - CTTTCAGCCATGCTGTAGAGAAAATGGTCCACAGC AC 252 0 

Qy 2698 AATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGT-ACATTTAACAGCTA 2756 

Ml MM MIMM I II II MM I I MM M I I Mill 

Db 2521 AATATGATAGCGAAAATACCGTGGTTTAACGCCATAGAAAATAGTCACTGTAACCAGCTC 2580 

Qy 2757 CCTGTAAAG CTTATTACTAA - TTTTTGTATTATTTTTGTAAATAGCCAATAGAAAAGTTT 2 815 

II I II II III II Mill I Mill II MMMIMMIMM I I 

Db 2581 TCTCGGAGGCATACTACCAACTTTTTATGTTATTCCTGAAAATAGCCAATAGAAAGGCGT 2 64 0 

Qy 2816 GCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGAAC 2875 

M MMIIIMIIMI in ii ii i i i i mm I 

Db 2641 TCTGGACATGGTGCTTTTTCTAAAACGTAGAAGCCAAACTGCTTCGGGGTCTGCAAGATC 2700 

Qy 2876 CTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAA 2 923 

2701 CTCCT CTTTGCGC^TTCT 2758 
Qy 2924 - - GTGCCTTAGGATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGA 2981 

II II I I || MIMM lllllllllll III II. Ill I II 

Db 2759 ACGTGCCTTAGGTTCACTCCGGATGAGCGGTGTGTGAAAGAATGCCCAAGAGAAAACTGA 2818 



Qy 



2982 AGAGAGAGGAAATGAGGTGGGGTTGGAGGAAACCCATGGGGACAGATTCCCATTCTTAGC 3041 



I II MM II I IIIIII Ml IIIMI | IIIIIIIIIIIIIII 

Db 2819 AGAGAGAGGAAATGAGGTGGGGCCAGAGGAAGC C CGTGGGGAAATATTCC CATT CTTAG C 2878 

Qy 3042 CTAACGTTCGTCATTGCCTCGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAA 3101 

I IMIIMI 1 1 1 1 Mill II II IMIIIIIII II I MINIMI 

Db 2879 CCTGTGTTCGTCACTGCCACGTCATGTCGGTGTGAAAGGTCCTGGTTCGGCTCCAGCAAA 2938 

Qy 3102 ACACAGTGCAATGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTC 3161 

III II III llllllll Mill I I II III MM lllllllllll 

Db 2 93 9 ACAAAGCGCAGCGTTCTCAGCGTGAC-TCGGGAACAAACCAAGCCCGAGAGCTTTAACCT 2997 
Qy 3162 GGTCTTAAAATATGCCCAAATTTT 3185 

lllllllllll I I Ml 

Db 2 998 TGTCTTAAAATATAACAGATTTTCCTTCCTTCCTTTTTCTCTTTCTTCTCTTCTCTTCTC 3057 
Qy 318 6 TACTTTGTTTTTCTTTTAATAGGCTGGGCCACATG 322 0 

I II I I lllllll III I Illllllll 

Db 3058 TTCTCTTCTCTTCTCTTCTCTTCTCTTCTCTTCTCTTCTTTTCATAACCCAGGCCACATG 3117 

Qy 3221 TTGGAAATAAGCTAGTAATGTTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACC 328 0 

Ml 1 1 1 1 MM I III I I I I I Mill MM IIIIII 

Db 3118 TTGAAAATGAG CTTAACAATG CAGTTTT CTACCAAAATCATTGTGACAATACAATAAACC 3177 

Qy 3281 AAAA C C CAA CAATG TGG C CAGAAAGAAAG AG CAATAATAATTAATT CA CA CA C CATATGG 334 0 

MM MIMI 1 1 I III MM MMI I I I MM I 

Db 3178 CAAACGGGACAATGAGGTAAAAAACCAAGAACAATACTGAATCCACGTGACAC ATG 3233 

Qy 3341 ATTCTATTTATAAATCACCCACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAG 34 00 

I III MM I Illllllll I I I II I 

Db 3234 ACTCTCTTTAGGAGTCACCCACAGTTCTTGTGTGTA CAGAT 3274 

Qy 34 01 AGGCCTGTTATCATAGAAGTCATTTTAGACTCTCAATTTTAAATTAATT - TTGAATCACT 3459 

I I I MMM Ml III I f 1 1 1 1 1 1 1 1 Ml II III II I 

Db 3275 TGCTTTTTAATCATAAAGGACGCCCCAGATCTTCAATTTTAAGTTAGTTATTGGCTCCCC 3334 

Qy 34 60 AATATTTTCACAGTTTATTAATATATTTAATTTCTATTTAAATTTTAGATTATTTTTATT 3519 

I M MIIIMI I MM llllllll II III Illllllll lllllll 

Db 3335 AGTAGTTTCACAGCGTGGATATATTTTTAATTTTTA- CTAAGTTTTAGATTGGTTTTATT 33 93 
Qy 352 0 ACCATGTACTGAATTTTTACATCCTGATACCCTTTCCTTCTCCATGT CAGTA 3571 

Ml II MM III MM I I I II I III I I 

Db 33 94 GTTGTGTTCTAAATTCTTAAGTCCTAACATCTTTGTTTAACCCAGATGTTCCTTCCCTCT 34 53 

Qy 3572 TCATGTTCTCTAATTATCTTGCCAAATTTTGAAACTACACACAAAAAGCATACTTGCATT 3631 

Mill I MM II Illllllll Mill III II I || | || 

Db 34 54 TCATGGGCAATAATCGTCCTGCCAAATTATGAAATGGCATAAGAATACTATTCACATAAT 3513 

Qy 3632 ATTTATAATAAAATTGCATTCAGTGGCTTTTTAAAAAAAATGTTTGATTCAAAACTTTAA 3691 

II M lllllll I III lllllllllll I MM III II I 

Db 3514 ATATACAATAAAACTATATTAAGTGGCTTTTTTATTAAAAATTTTAGCACA CAG 3567 

Qy 3692 CATACTGATAAGTAAGAAACAATTATAATTTCTTTACATACTOW^CC^GATAGAAAA 3751 

I M I I III II II III I II MM I IMIMMMII I I 

Db 3568 ACCAAGGGTGATAAGAAAAAAAACATGATTCCCTTGCATAATTAAAACCAAGATAAGAGA 3627 

Qy 3752 AGGTGCTATCGTTCAACTTCAAAACATGTTTCCTAGTATTAAGGACTTTAATATAGCAAC 3811 

MM I III II II M MM I Mill I IMMIIMM 



Db 



3628 AGGTACCATCT AATTTAAAG CATATTTTCTAACATTTAAGTAG CCTAATATAG CAAT 3684 



Qy 3812 AGACAAAATTATTGTTAACATGGATGTTACAGCTCAAAAGATTTATAAAAGATTTTAACC 3871 

I 1 1 1 1 II Illlllll Illlllll II Mill Mill Ml M II I II 

Db 3685 GCATAAAAATAGTGTTAACAAGGATGTTAGAGGTCAAACGATTTGTAAGTGACTTCAGCC 3744 

Qy 3872 TATTTTCTCCCTTATTATCCACTGCTAATGTGGATGTATGTTCAAACACCTTTTAGTATT 3 931 

MIIIIIIMI Mill lllllll I I I I I IMMIIIII III llllll 

Db 374 5 TATTTTCTCCCAAATTATTTACTGCTATTTTTGGTCTGTGTTCAAACA - TTTTCAGTATT 3803 

Qy 3932 GATAGCTTACATATGGCCAAAGGAATACAGTTTATAGCAAAACATGGGTATGCTGTAGCT 3991 

MM I II I i II MM I I III MM II MM II 

Db 3804 GATAATGTGCA-ACAGCCAAAGGAACACTGTTTTCATCCAAATGCGGGTGTGTTGTACCT 3862 

Qy 3992 AACTTTATAAAAGTGTAATATAACAATGTAAAAAATTATATATCTGGGAGGATTTTTTGG 4 051 

III II I lllllll I I lllllll II I I I I I III 

Db 3863 AAC ATGCACTTGTAATAAAGCCGTGTAAAA TAACTGTGTTTTGTTTTGCTCTGG 3 916 

Qy 4 052 TTGCCTAAAGTGGC TATAGTTACTGATTTTTTATTATGTAAGCAAAACCAATAA 4105 

I MIIMIMM I I Ml M I II M II MIMMMMMMM 

Db 3917 TCACCTAAAGTGGCAGCTTGTGTCGTTGCTAACTTCTTGTTGAGTAAGCAAAACCAATAA 3 976 

Qy 4106 AAATTTAAGTTTTT 4119 

Illlllll 
Db 3977 ACGTTCAAATGGTT 3 990 
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AK082103 2521 bp mRNA linear HTC 05-DEC-2002 

Mus musculus 0 day neonate cerebellum cDNA, RIKEN full-length 
enriched library, clone : C230007M01 product : ENDOTHELIN B RECEPTOR 
PRECURSOR, full insert sequence. 
AK082103 

AK082103 . 1 GI :26349538 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata,- Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
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RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome -res@gsc . riken. go . jp, 
URL :http: //genome. gsc.riken.go.jp/ / Tel : 81-45-503-9222 , 
Fax:81-45-503-9216) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : //genome . gsc . riken . go . jp/ 
URL : ht tp : / / f ant om .gsc. riken . go . j p/ . 

Location/Qualifiers 

1. .2521 

/organism="Mus musculus" 

/ mo 1 _t yp e = " mRNA " 

/strain="C57BL/6J" 

/ db_xre f = " FANTOM_DB : C2 3 0 0 0 7M0 1 " 

/db_xref ="taxon: 10090" 

/clone=" C230 007M01" 

/ 1 i s sue_type= " cerebel lum" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="0 day neonate" 
213. .1541 

/not e= "unnamed protein product; ENDOTHELIN B RECEPTOR 

PRECURSOR (SWISSPROT| P48302, evidence: FASTY, 100%ID, 

100%length, match=1326) 

putative" 

/ codon_s t art = 1 

/protein_id="BAC38409 .1" 

/db_xref="GI : 2634 953 9" 

/ 1 rans 1 a t ion= " MQS PASRCGRALVALLLACGFLGVWGEKRGFPPAQATLSLLGTK 
E VMT P PTKTSWTRGSNS SLMRS SAPAE VTKGGRGAGVP PRS F P P PCQRN I E I S KTFKY 
I NT I VS CLVF VLG I IGNSTLLRI I YKNKCMRNG PN I L I ASLALGDLLH 1 1 1 DI P I NTY 
KLLAEDWPFGAEMCKLVPFIQKASVGITVLSLCALSIDRYRAVASWSRIKGIGVPKWT 
AVE I VL I WWS VVLAVPEAI GFDM I TSD YKGKPLRVCMLNPFQKTAFMQFYKTAKDWW 
LFSFYFCLPLAITAVFYTLMTCEMLRKKSGMQIALNDHLKQRREVAKTVFCLVLVFAL 
CWLPLHLSR I LKLTLYDQSNPHRCELLS FLLVLDY I G I NMASLNSCI NP I ALYLVSKR 
FKNCFKSCLCCWCQTFEEKQSLEEKQSCLKFKANDHGYDNFRSSNKYSSS" 
707 a 588 c 589 g 637 t 



Query Match 26.4%; 
Best Local Similarity 76.4%; 
Matches 1507; Conservative 



Score 1137.6; DB 11; 
Pred. No. 3.5e-151; 
0; Mismatches 43 9; Indels 



Length 2521; 

26; Gaps 



8; 



Qy 
Db 



193 AAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATGCAGCCGCCTCCA 252 

MM M MMMI Ml Ml I I MUM I I I llllll MM II 

168 AAACAGCAGAGCGGCTACC^GACTCTCAC^GGAGCAAGCTGTAAC^TGCAATCGCCCGCA 227 



Qy 

Db 

Qy 

Db 



253 AGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTGTCGCGGATCTGG 312 

I! I IMIIIIIIII 1 1 1 1 llllll I II Mill MM I I II I Ml 

228 AGCCGGTGCGGACGCGCCTTGGTGGCGCTGCTGCTGGCCTGTGGCTTCTTGGGGGTATGG 287 
313 GGAGAGGAGAGAGG CTTCCCG CCTGACAGGGCCACTC - - - CGCTTTTGCAAACCGCAGAG 369 

llllll I Mill Mill MM I Mill I I Ml I II MM 

288 GGAGAGAAAAGAGGATTCCCACCTGCCCAAGCCACGCTGTCACTTCTCGGGACTAAAGAG 347 



Qy 


370 


Db 


348 


Qy 


430 


Db 


408 


Qy 


490 


Db 


467 


Qy 


550 


Db 


525 


Qy 


610 


Db 


585 


Qy 


670 


Db 


645 


Qy 


730 


Db 


705 


Qy 


790 


Db 


765 


Qy 


850 


Db 


825 


Qy 


910 


Db 


885 


Qy 


970 


Db 


945 


Qy 


1030 


Db 


1005 


Qy 


1090 


Db 


1065 


Qy 


1150 


Db 


1125 



IIIIIMMIIII III llllllll Ml Ml lllllllll llllllll Ml 



TCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCGCCACXjCA.CC 489 

II I MMMMMIMM I lllllll Ml III III MUM I I 

TCCTCCGCACCTGCGGAGGTGACCAAAGGAGGGAGGGGGGCTGGAGTCCCGCCAAGATC- 466 

ATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATCAAC 549 

II llllli lllllllll II II lllllll lllllll I I III 

- - CTTCCCTCCTCCGTGCCAACGAAATATTGAGATCAGCAAGACTTTTAAATACATCAAC 524 

ACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGA 609 

III lllllll Mill MINI || Mlllllllllllllll II II III 

ACGATTGTGTCGTGCCTCGTGTTCGTGCTAGGCATCATCGGGAACTCCACGCTGCTAAGA 584 

ATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCT 669 

II I ! II llllli: || I I II I I I I I I I I II I I I I I II I I Mill 

ATCATCTACAAGAACAAGTGCATGCGCAATGGTCCCAATATCTTGATCGCCAGTCTGGCT 644 

CTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCA 72 9 

lllllllllll lllllllll MM Mill II II II lllllll MM Ml 

CTGGGAGACCTACTGCACATCATCATAGACATACCCATTAACACCTACAAGTTGCTCGCA 704 
GAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCC 78 9 

II M II Ml MMM M II I IMMi M lllllllllll I I 

GAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCCTTCATACAGAAGGCTTCT 764 
GTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCT 84 9 

MMIIIMII MUM IMMIM lllllllllllllllllllllllllll 

GTGGGAATCACAGTGCTGAGTCTTTGTGCTCTAAGTATTGACAGATATCGAGCTGTTGCT 824 
TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 909 

lllllllll II MMM M II I II IM IMM M Ml M M 

TCTTGGAGTCGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTA 884 
ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 969 

MM Ml IM I II M IM M II I Ml II II MMM 

ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCCGAAGCCATAGGTTTTGATATGATTACG 944 
ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 102 9 

II III III II I Mill Mill Mill lllllll Mill 

TCGGACTACAAAGGAAAGCCCCTAAGGGTCTGCATGCTTAATCCCTTTCAGAAAACAGCC 1004 
TTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 108 9 

II I MMM MMM M I II 1 1 1 II II 1 1 1 1 1 1 1 1 1 II II 1 1 1 lllllllll 

TTCATGCAGTTTTACAAGACAGCCIAAAGATTGGTGGCTGTTCAGTTTCTACTTCTGCTTG 1064 
CCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTGAGAAAGAAA 114 9 

II I M MM I llllllll || IMMIM MMM I II Mill 

CCGCTAGCCATCACTGCAGTCTTTTATACCCTGATGACCTGCGAAATGCTCAGGAAGAAG 1124 
AGTGG CATG CAGATTGCTTTAAATGATCAC CTAAAG CAGAGACGGGAAGTGGC CAAAACC 12 09 

II II I Ml I II lllllllll IM II M lllllllllll II 

AGCGGTATGCAGATTGCTTTGAATGATCACTTAAAGCAGAGACGAGAAGTGGCCAAGACA 1184 



Qy 1210 GTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATT 1269 

Mill I II II || Mill Mill I MM 

Db 1185 GTCTTCTGCCTGGTCCTCGTGTTTGCTCTCTGTTGGCTTCCCCTTCACCTCAGCCGGATC 1244 

Qy 1270 CTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTG 132 9 

IIIMIIIIM II III I 1 1 1 1 1 1 1 1 I II 1 1 II I III MINIM II 

Db 1245 CTGAAGCTCACCCTGTATGACCAGAGCAATCCACACAGGTGTGAGCTTCTGAGCTTTTTG 13 04 

Qy 1330 TTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAATT 1389 

Mill MMMM I II, MM MMMM II Mill 

Db 1305 TTGGTTTTGGACTACATTGGTATCAACATGGCTTCTTTGAACTCCTGCATCAATCCAATC 1364 

Qy 1390 GCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTGG 144 9 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 

Db 1365 GCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGTTTGTGCTGCTGG 1424 

Qy 14 50 TGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAAA 15 09 

Mill I Mill Mill IIMIIIIIMIII II. Ill I ill! 

Db 1425 TGCCAAACGTTTGAGGAAAAG CAGT CCTTGGAGGAGAAGCAGTC CTG CCTGAAGTTCAAA 14 84 

Qy 1510 GCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAG 1569 

II II IIIIIIIIIIIIIIIIMIIMI Mill MMMMMIMI lllllll I 

Db 14 85 GCCAACGATCACGGATATGACAACTTCCGGTCCAGCAATAAATACAGCTCGTCTTGAAGG 1544 

Qy 1570 AAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAAAACAAAATG 162 9 

I I I II I I II III I II I II Mill II I 

Db 154 5 CAAGAACACTCGCCGAATCTCACTGTCCTCATTGTGGACAGATACCATTAAAACAAAATG 1604 

Qy 163 0 AAACATTTGCCAAAACAAAACAAAAAACTATGTATT^ 1689 

MM MMMM Mill MM I I Ml I I II 

Db 1605 AAACCGTTG CCAAATCAAAATGGAAAAAACCATG CTAGCAGAAAGGTGTGCG CG CGTGTG 1664 

Qy 1690 AAGTGTAATTATTTTAACACTCACAGCTACATATGACAT TTTATGAG CTGTTTAC 1744 

I III lllllll I I II II I I lllllllll 

Db 1665 AGAGGGATTATTTTTAACTGTTCTGACGCTCAACACCGGATATATTCACGGG CTGTTTAC 1724 

Qy 1745 GGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAATTTTT 1804 

MM II Mill Ml MM I I 1 1 1 1 1 1 II I II I 

Db 1725 AACCTAAGAAAGCTGTGGGAAGGAATGAAGCCCTCCTCCGTGGGGAAGCACTTAGATTCT 1784 

Qy 1805 TACAGTTAGCACTTCAACATAGCTCTTAACAACTTCCAGGATATTC^ 1864 

I III lllllllll II lllllllll I I I II llllll llllll I 

Db 1785 T- -AGTCAGCACTTCAGCAGAGCTCTTAAAAGCCCCTAGTGCGTTCACATGCCACTTACG 1842 

Qy 1865 CTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTAAATC 1924 

lllllll I MM III I I II II II I I 

Db 1843 TTTAAAAAAACG AGAACTTCACTGAAGTTCTGTTCAGGAGTTTATTATCCAGT 18 95 

Qy 1925 AATGGGACTCTGATATAAAGGAAGAATAAGT CACTGTAAAACAGAACTTTTAAATGAAG C 1984 

I M MM I III Ml II I I III I II Ml MM 

Db 1896 CCTATGAATCTGGATTCAAGAAAGCATGA CATTGCAAAACAATTCTTAAAACGAAGT 1952 

Qy 1985 TTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATATTATC 2044 

II MM II Mill III II MM llllll II III MM 

Db 1953 TTCAATTGCTTAATTTGAAACTTAAAAAAAAAAAACTAATAAATTTTTATGCATACTATC 2012 

Qy 2 0 4 5 - -ACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCA- TT 2101 



II I I III MINIM II lllllll II I III I I MM I I II 

Db 2 013 ATACCCACTAATCTGATTGTAACTATATGC^^GAAAAGGC^TATGGTTGGTAAACTT 2 072 

Qy 2102 TTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCA 2153 

III II II I I Ml I I f 1 I I I I I I I I Mill II 
Db 2 073 TTTTGGTCATTACCAACATTGAAATGATCAGAATTCGGGGGAAGAAAAGACA 2124 
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Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : ht tp : / /genome . gsc . riken . go . j p/ 
URL : ht tp : / / f ant om . gsc . riken . go . j p/ . 

Location/Qualifiers 

1. .3611 

/organism="Mus musculus" 

/ mo 1 _ t yp e = " mRNA " 

/strain="C57BL/6J" 

/ db_xr e f = " FANTOM_DB :D430047G06" 

/db xref="taxon: 10090" 



/clone="D430047G06" 
/tissue_type= M lung". 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage=" 13 days embryo" 
CDS 217. .1155 

/not e=" unnamed protein product; ENDOTHELIN B RECEPTOR 

PRECURSOR (SWISSPROT|P48302, evidence: FASTY, 100%ID, 

100%length, match=1326) 

putative " 

/codon_start=l 

/prot ein_id= " BAC3 9379.1" 

/db_xref="GI : 26351485" 

/ 1 rans 1 a t i on = " MQS PASRCGRALVALLLACGFLGVWGEKRGF P PAQATLSLLGTK 
EVMTPPTKTSWTRGSNSSLMRSSAPAEVTKGGRGAGVPPRSFPPPCQRNI EI SKTFKY 
INTIVSCLVFVLGI IGNSTLLRI IYKNKCMRNGPNILIASLALGDLLHI I IDIPINTY 
KLIAEDWPFGAEMCKLVPFIQKASVGITVLSLCALSIDRYRAVASWSRIKGIGVPKWT 
AVEIVIjIWWSVVLAVPEAIGFDMITSDYKGKPLRVCMLNPFQKTAFMQFYKTAKDWW 
LFSFYFWLAASHHCSLLYPDDLRNAQEEERYADCFE" 

BASE COUNT 1014 a 821 c 776 g 996 t 4 others 

ORIGIN 

Query Match 26.2%; Score 1126.4; DB 11; Length 3611; 

Best Local Similarity 64.4%; Pred. No. 1.2e-149; 

Matches 2333; Conservative 0; Mismatches 952; Indels 336; Gaps 30; 
Qy 193 AAACTGCGGAGCGGCC^VCCGGACGCCTTCTGGAGCAGGTAGCAGCATGCAGCCGCCTCCA 252 

1 1 1 1 II lllllll III Ml I I MINI I I I MINI 1 1 1 1 II 

Db 172 AAACAGCAGAGCGGCTACCAGACTCTCACAGGAGCAAGCTGT 231 

Qy 253 AGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTGTCGCGGATCTGG 312 

II I lllllll 1 1 1 1 III I II Mill Ml I I I II I III 

Db 232 AGCCGGTGCGGACGCGCCTTGGTGGCGCTGCTGCTGGCCTGTGGCTTCTTGGGGGTATGG 2 91 
Qy 313 GGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTC CGCTTTTGCAAACCGCAGAG 369 

MINI I Mill Mill MM I Mill I I III I II MM 

Db 2 92 GGAGAGAAAAGAGGATTCCCACCTGCCCAAGCCACGCTGTCACTTCTCGGGACTAAAGAG 351 

Qy 37 0 ATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGTCTGGCGCGG 429 

l : IM I III I II I Ml III II II I ill ill Ml 

Db 352 GTAATGACGCCACCCACTAAGACCTCCTGGACCAGAGGTTCCAACTCCAGTCTGATGCGT 411 

Qy 430 TCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCGCCACGCACC 489 

II I MMMMMIMM I lllllll III III III MUM I I 

Db 412 TCCTCCGCACCTGCGGAGGTGACCAAAGGAGGGAGGGGGGCTGGAGTCCCGCCAAGATC- 470 

Qy 4 90 ATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATCAA^ 54 9 

II MUM II Ml II II lllllll lllllll MIMMIMM 

Db 471 --CTTCCCTCCTCCGTGCCAACGAAATATTGAGATCAGCAAGACTTTTAAATACATCAAC 528 

Qy 550 ACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGA 609 

III lllllll Mill III I I II II INI III M || M Ml 

Db 52 9 ACGATTGTGTCGTGCCTCGTGTTCGTGCTAGGTATCATCGGGAACTCCACGCTGCTAAGA 588 

Qy 610 ATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCT 669 

M 1 1 II 1 1 1 i 1 1 J 1 1 1 1 i 1 1 1 1 i 1 1 II 11-11 I II I M Mill 

Db 58 9 ATCATCTACAAGAACAAGTGCATGCGCAATGGTCCCAATATCTTGATCGCCAGTCTGGCT 64 8 



Qy 


670 


Db 


649 


Qy 


730 


Db 


709 


Qy 


790 


Db 


769 


Qy 


850 


Db 


829 


Qy 


910 


Db 


889 


Qy 


970 


Db 


949 


Qy 


1030 


Db 


1009 


Qy 


1089 


Db 


1069 


Qy 


1149 


Db 


1129 


Qy 


1209 


Db 


1189 


Qy 


1269 


Db 


1249 


Qy 


1328 


Db 


1309 


Qy 


1388 


Db 


1369 


Qy 


1448 


Db 


1429 


Qy 


1508 



CTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCA 729 

Illllllllll lllllllll 1 1 1 1 Mill II II II lllllll MM III 

CTGGGAGACCTACTG CACATCATCATAGACATACCCATTAACACCTACAAGTTG CTCG CA 7 08 

GAGGACTGGC CATTTGGAG CTGAGATGTGTAAG CTGGTGCCTTTCATACAGAAAG CCTCC 789 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 II 1 1 1 1 II I II M I IMIMIMM II II 

GAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCCTTCATACAGAAGGCTTCT 768 

GTGGGAATCACTGTG CTGAGTCTATGTG CTCTGAGTATTGACAGATATCGAGCTGTTG CT 84 9 

I II III MM I IMIMIMM IIIIIIII Illllllllllllllllllllll MM 

GTGGGAATCACAGTGCTGAGTCTTTGTGCTCTAAGTATTGACAGATATCGAGCTGTTGCT 828 

TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 909 

lllllllll II II II I ill I II II li: I! I MM II 

TCTTGGAGTCGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTA 888 

ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 969 

I ,1 II Mill II I III IIIIIIIIIIIIIIIIIIII I Ml 

ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCCGAAGCCATAGGTTTTGATATGATTACG 948 

ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 102 9 

I I I I I I I I I I I I I I II I Mill Mill Mill lllllll Mill 

TCGGACTAOU^AGGAAAGCCCCTAAGGGTCTGCATGCTTAATCCCTTTCAGAAAACAGCC 1008 

TTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCT - GCTT 1088 

MMMMMMMIIMIMM MMMMMMMMMMMIMI MM MM 

TT(^TGC^GTTTTACAAGACAGCCAAAGATTGGTGGCTGTTCAGTTTCTACTTCTGGCTT 1068 

GCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTGAGAAAGAA 114 8 

Ml I MMIMIMM I IIIIIIII II IMIIMI MUM I II Mill 

GCCGCTAGCCATCACTGCAGTCTTTTATACCCTGATGACCTGCGAAATGCTCAGGAAGAA 1128 

AAGTGG CATG CAGATTG CTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAAC 12 08 

II II llllllllllllll lllllllll lllllllllllll Illllllllll II 

GAG CGGTATG CAGATTG CTTTGAATGATCACTTAAAGCAGAGACGAGAAGTGGCCAAGAC 118 8 

CGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGAT 1268 

Mill IMIMMIM II Mill Mill MMMMMMMMIMM MM 

AGTCTTCTGCCTGGTCCTCGTGTTTGCTCTCTGTTGGCTTCCCCTTCACCTCAGCCGGAT 124 8 

TCTGAAG CTCA - CTCTTTATAATCAGAATGATC CCAATAGATGTGAACTTTTGAG CTTTC 1327 

llllllllll I II III I 1 1 1 1 1 1 1 1 I II Mill III IIIIMII 

CCTGAAG CTCAG C CCTGTATGACCAGAG CAATCCACACAGGTGTGAG CTTCTGAG CTTTT 13 08 

TGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAA 1387 

MMIII IIIIMII 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 MM IIIIIIII II MM 

TGTTGGTTTTGGACTAC^TTGGTATC^C^TGGCTTCTTTGAACTCCTGCATCAATCC^ 1368 

TTGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCT 1447 

I M II MIMM IMM Ml I II M II II ! || MMIII 

TCGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGTTTGTGCTGCT 1428 

GGTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCA 1507 

MMIII I Mill Mill i 1 1 1 1 i 1 r ( 1 1 1 1 1 MMIII! Ml I MMIII 

GGTG CCAAACGTTTGAGGAAAAG CAGTC CTTGGAGGAGAAGCAGTCCTG C CTGAAGTTCA 14 88 

1508 AAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAA 1567 



1 1 1 1 II IIIIIIIIIIIIIIMIIIIIII Mill llllllllllllll MMIII 

Db 148 9 AAGCCAACGATCACGGATATGACAA 1548 

Qy 1568 AGAAGAACTATTCACTGTATTTCATTTTCTTTATATTGGACCGAAGTCATTAAAACAAAA 1627 

II I I II II II III I II I II Mill II MIMMMIMI 

Db 154 9 GGCAAGAACACTCGCCGAATCTCACTGTCCTCATTGTGGACAGATACCATTAAAAC^AAA 1608 

Qy 1628 TGAAACATTTGCCAAAACAAAACAAAAAACTATGTATTTC 1687 

MINI MINIM Mill MM I I Ml I I I 

Db 1609 TGAAACCGTTGCCAAATCAAAATGGAAAAAACCATGCTAGCAGAAAGGTGTGCGCGCGTG 1668 
Qy 168 8 TTAAGTGTAATTATTTTAACACTCACAGCTACATATGAC ATTTTATGAG CTGTTT 1742 

II III MMIII I I II II I I MMIII 

Db 1669 TGAGAGGGATTATTTTTAACTGTTCTGACG CTCAACAC CGGATATATTCACGGG CTGTTT 1728 

Qy 1743 ACGGCATGGAAAGAAAATCAGTGGGAATTAAGAAAGCCTCGTCGTGAAAGCACTTAATTT 1802 

II I I II II Mill Ml MM I I lllllllll II 

Db . 172 9 ACAACCTAAGAAAGCTGTGGGAAGGAATGAAGCCCTCCTCCGTGGGGAAGCACTTAGATT 1788 

Qy 1803 TTTACAGTTAGCACTTCAACATAGCTCTTAACAACT^ 1862 

II Ml IMMIIII M lllllllll II Ml MUM MUM 

Db 178 9 CTT - - AGTCAGCACTTCAGCAGAGCTCTTAAAAGCCCCTAGTGCGTTCACATGCCACTTA 184 6 

Qy 1863 GGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTAAA 1922 

I MMIII I MM III I I II II M I I 

Db 1847 CGTTTAAAAA AACGAGAACTTCACTGAAGTTCTGTTCAGGAGTTTATTATCCA 1899 

Qy 1923 TCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATGAA 1982 

I II MM I III III II I II II MUM III II I 

Db 1900 GTCCTATGAATCTGGATTCAAGAAAGCAT - - GACATTGCAAAACAATTCTTAAAACGAAG 1957 

Qy 1983 GCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACAACTTTTCAATTAATATTA 2 042 

I II I II II III II MM I I I III II III II 

Db 1958 TTTCAATTGCTTAATTTGAAACTTAAAAAAAAAAAAACTAATAAATTTTTATGCATACTA 2 017 

Qy 2 0 4 3 TC - - ACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTG CA - 2 099 

II Ml I III IIIIIMI M IIIMM II I Mill MM I I 

Db 2018 TC^TACC(^CTAATCTGATTGTAACTATATGGW^AGAAAAGGCAATATGGTTGGTAAAC 2 077 

Qy 2100 TTTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAGAAAGAGCAAGGCTG 215 9 

Mill II II I MUM IMIIIMM I I Mill II 

Db 2078 TTTTTTGGTC^TTACC^C^TTGAAATGATC^GAATTCGGGGGAAGAAAAGACAG 2132 

Qy 2160 TTTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCAGCATTC 2219 

Db 2133 2132 

Qy 222 0 CCAAC7VTGTCAGAAACAAGGA£ 227 9 

II I I II II I MM I II M 

Db 2133 CCTGCGAATGCCACAGAGAAAACATGGGAAAGCGTG 2168 

Qy 2280 ATATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATCAAACCTC 233 9 

III III II MM II III 

Db 2169 AGCTGCTATGCCTGAGACTTCTGAAATTCCCTCACACATACTCTGC 2214 

Qy 234 0 ACAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTC 2399 

I Mill III I II I I I II II III II I I 



Db 


2215 


AGAAAGACACAAA ACAGAACACTACCTATGATTTCTTTAAAGTTCTTTCAA 


2265 


Qy 


2400 


ACATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTT 

llll i i ii ii iiiiiiiii iiii 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 llll 
ATATCCTTTCATGATTGAAGTTTAAATTCCATGTGTTCAACTTCATCA - 


2459 


Db 


2266 


2313 


Qy 


2460 


TTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATAC 


2519 


Db 


2314 


1 Ml 1 II III III 1 1 
TCTGTAAATACTTAGCTATTAGCTATAAGC 


2343 


Qy 


2520 


ACTGCATGTAGATGATTAAATGA - - GGG CAGGC C CTGTGCTCATAG CTTTACGATGGAGA 

III II lllll ii i ii i ii t iii i i ii iiiiiii ii ■!■ 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II III 
ACTACACGTAGAGGACTTAACAAAGGGCAGGTCCCAGCGTTCGTAGCTTTCTGACAAAGA 


2577 


Db 


2344 


2403 


Qy 


2578 


GATGCCAGTGACCTCATAAT - - AAAGACTGTGAACTGCCTGGTGCAGTGTCCACATGACA 

IIIIIIIII III 1 II I ill iiiii ■ iiii iiiiiiiiiiitiiiii ii 

llllllllllll III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 I I I I || I I I I I I | 
GATGCCAGTAACCCGGTTATAGACAGAATGTGAATTGCCCGGTGCAGTGTCCACATGGCA 


2635 


Db 


2404 


2463 


Qy 
Db 


2636 
2464 


AAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATGT 

III liiit iiii i ii iii iiiiiiiii i iiitiiii iii 

III llllllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllllll III 
AAGAAGCAGGGAGCATC- - CTTTCAGCCATGCTGTAGAGAAAATGGTCCACAGC 


2695 
2515 


Qy 


2696 


ATAATG CTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGT - ACATTTAACAG C 

1 III MM lllllll 1 II II llll 1 1 MM II 1 1 llll 

ACAATATGATAG CGAAAATACCGTGGTTTAACG CCATAGAAAATAGTCACTGTAACCAGC 


2754 


Db 


2516 


2575 


Qy 


2755 


TAC CTGTAAAG CTTATTACTAA - TTTTTGTATTATTTTTGTAAATAG CCAATAGAAAAGT 
i ii i ii ii iii ii iiiii i iiiii ii i i ■ i i i i i i i i ■ i i i i i 

III III II III II Mill 1 IIIII II MINI IMIIIIM 1 

TCTCTCGGAGGCATACTACCAACTTTTTATGTTATTCCTGAAAATAGCCAATAGAAAGGC 


2813 


Db 


2576 


2635 


Qy 


2814 


TTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGA 

1 II MMMMIIMM III II II 1 1 1 1 llll 
GTTCTGGACATGGTGCTTTTTCTAAAACGTAGAAGCCAAACTGCTTCGGGGTCTGCAAGA 


2873 


Db 


2636 


2695 


Qy 


2874 


ACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGG 
iiiii i i i i iiiiiiiiii ii iiii 

llll III II IIIII II II 1 II 1 1 II 

TCCTCCTCTTTGCGCATTCTTGTCTAGGTTTTTTTTTTTTTTTTTTTAATCTCCTTTCCA 


2933 


Db 


2696 


2755 


Qy 


2934 


ATAGCTTGGGATGAGATGTGTGTGAAAGTATGTA 

iiii iii i iii i 

llll III 1 llll 
CNAACGTGCCCTTAGGGTTCAACTCCGGAATTGAAGCCGGTGGTGTNNGAAANGAAATGC 


2967 


Db 


2756 


2815 


Qy 


2968 


- - CAAGAGAAAACGGAAGAGAGAGGAAATGAGGTGGGGTTGGAGG AAA 

llllllllll iiiiiiii i iiiiiiiiii i 

1 1 II II 1 1 II llllllll 1 llllllllll I 
CCCAAGAGAAAAACTTAAAGAGAGAAGGAAATTTGAGGTTGGGGGCCCAGAAAGAAAAGC 


3013 


Db 


2816 


2875 


Qy 


3014 


CCCATGGGG ACAGATTCCCATTCTTAGCCTAACGTTCGTCATTGCCTCGTCACA 

III Iiiii i i iiiiiiiiiiiiiiii iiiiiiii iiii iiiii 

Ml IIIII 1 1 II llllll IIIM llllllll MM IIIII 

CCCTTGGGGGAAAATAAATATTCCCATTCTTAGCCCTGTGTTCGTCACTGCCACGTCATG 


3067 


Db 


2876 


2935 


Qy 


3068 


TCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTC 

II II llllllllll II 1 MMMIIMM II III IMIIIII IIIII 

TCGGTGTGAAAGGTCCTGGTTCGGCTCCAGCAAAACAAAGCGCAGCGTTCTCAGCGTGAC 


3127 


Db 


2936 


2995 


Qy 
Db 


3128 
2996 


TTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAATATGCCCAAATT 

1 1 II III MM MIIMIMM MMMIIMM 1 1 II 

- TCGGGAACAAACCAAG C C CGAGAGCTTTAACCTTGTCTTAAAATATAACAGATTTTC CT 


3183 
3054 



Qy 


3184 




3186 


Db 


3055 


1 1 

TCCTTCCTTTTTCTCTTTCTTCTCTTCTCTTCTCTTCTCTTCTCTTCTCTTCTCTTCTCT 


3114 


Qy 


3187 


ACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAATGTTGTTT 

1! 1 1 Ml Ml 1 MM II 1 INI INI 1 II 

TCTCTTCTCTTCTTTTCATAACCCAGGCCACATGTTGAAAATGAGCTTAACAATGCAGTT 


3246 


Db 


3115 


3174 


Qy 


3247 


TCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAACAATGTGGCCAGAAAGA 

1 1 1 1 1 1 Mill MM MUM MM MUM II 1 III 

TTCTACCAAAATC^TTGTGAC^TACAATAAACCCAAACGGGAC^ATGAGGTAAAAAACC 


3306 


Db 


3175 


3234 


Qy 


3307 


AAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTATAAATCACCCACAAAC 

MM Mill 1 1 1 MM II III MM 1 1 MM 

AAGAACAATACTGAATCCACGTGACAC ATGACTCTCTTTAGGAGTCACCCACAGTT 


3366 


Db 


3235 


3290 


Qy 


3367 


TTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAAGTCATTTT 


3426 


Db 


3291 


i i i ii i i i i mill i i i 

CTTGTGTGTA CAGATTGCTTTTTAATCATAAAGGACGCCCC 


3331 


Qy 


3427 


AGACTCTCAATTTTAAATTAATT - TTGAATCACTAATATTTTCACAGTTTATTAATATAT 

III MM II Ml || Ml II I I || Mill I MM 1 

AGATCTTCAATTTTAAGTTAGTTATTGGCTCCCCAGTAGTTTCACAGCGTGGATATATTT 


3485 


Db 


3332 


3391 


Qy 


3486 


TTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTTACATCCTG 

III II 1 II MIIIIMI MM Ml || MM Ml MM 

TTAATTTTTACT-AAGTTTTAGATTGGTTTTATTGTTGTGTTCTAAATTCTTAAGTCCTA 


3545 


Db 


3392 


3450 


Qy 

Db 


3546 
3451 


ATACCCTTTCCT TCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAA 

1 1 1 M 1 1 1 1 1 1 Mill 1 MM II II II 1 1 1 

ACATCTTTGTTTAACCCAGATGTTCCTTCCCTCTTCATGGGCAATAATCGTCCTGCCAAA 


3597 
3510 


Qy 


3598 


TTTTGAAACTACACACAAAAAG CATACTTG CATTATTTATAATAAAATTGCATTCAGTGG 

M Mill II 1 II 1 II 1 1 Ml II Mill | Ml Mill 

TTATGAAATGGCATAAGAATACTATTCACATAATATATACAATAAAACTATATTAAGTGG 


3657 


Db 


3511 


3570 


Qy 


3658 


CTTTTTAAAAAAAATGTTTGA 3678 




Db 


3571 


llllll 1 III III 1 

CTTTTTTTATTAAAAATTTTA 35 91 





RESULT 5 
AK076426 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



AK076426 2669 bp mRNA linear HTC 05-DEC-2002 

Mus musculus 0 day neonate head cDNA, RIKEN full-length enriched 
library, clone : 4832401B07 product : ENDOTHELIN B RECEPTOR PRECURSOR, 
full insert sequence. 
AK076426 

AK076426. 1 GI :26345371 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodent ia; 
1 

Carninci,P. and Hayashizaki , Y. 



Craniata ; Vertebrata ; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; Mus. 



TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci, P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 
Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 
Normalization and subtraction of cap- trapper-selected cDNAs to 
prepare full-length cDNA libraries for rapid discovery of new genes 



Genome Res 
20499374 
11042159 
3 

Shibata, K. 



10 (10) , 1617-1630 (2000) 



Itoh,M. , Aizawa, K. 
Konno,H., Akiyama,J., Nishi # K. 
Sumi,N., Ishii,Y., Nakamura,S. 



- Ishii,Y., 
Fukuda , S . , 



Nagaoka , S . , Sasaki , N . , Carninci , P . , 
Kitsunai,T., Tashiro f H., Itoh,M., 
Hazama,M., Nishine,T., Harada,A., 
Yamamoto,R., Matsumoto, H . , Sakaguchi , S . , Ikegami,T., Kashiwagi , K. , 
Fujiwake,S. , Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Muramatsu,M. , Inoue,Y., Kira,A. and Hayashizaki , Y . 
RIKEN integrated sequence analysis (RISA) system- -384 -format 
sequencing pipeline with 384 mult icapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 
11076861 
4 

Kawai,J., Shinagawa,A. , Shibata, K., Yoshino,M,, ltoh,M. 
Arakawa,T., Hara,A., Fukunishi , Y . , Konno,H., Adachi,J., 
Aizawa, K., Izawa,M. , Nishi,K., Kiyosawa,H., Kondo,S., Yamanaka , I . , 
Saito,T., Okazaki,Y., Goj obori , T . , Bono,H., Kasukawa,T., Saito,R., 
Kadota,K., Matsuda,H., Ashburner, M. , Batalov,S., Casavant,T., 
Fleischmann,W. , Gaasterland,T. , Gissi,C, King,B., Kochiwa,H., 
Kuehl,P., Lewis, S., Matsuo,Y., Nikaido,I., Pesole,G., 
Quackenbush, J. , Schriml ,L.M. , Staubli,F., Suzuki, R. , Tomita,M., 
Wagner, L. , Washio,T. , Sakai,K., Okido,T., Furuno,M. # Aono,H., 
Baldarelli,R. , Barsh,G., Blake, J., Boffelli,D., Bojunga,N., 
Carninci, P., de Bonaldo, M. F. , Brownstein, M. J. , Bult,C, 
Fletcher, C, Fujita,M., Gariboldi , M. , Gustincich, S . , Hill,D., 
Hofmann,M., Hume, D. A., Kamiya,M., Lee, N . H . , Lyons, P., 
Marchionni,L. , Mashima,J., Mazzarelli , J . , Mombaerts , P. , Nordone,P., 
Ring,B., Ringwald,M., Rodriguez , I . , Sakamoto, N. , Sasaki, H. , 
Sato,K., Schonbach,C. , Seya,T., Shibata, Y., Storch,K.F., Suzuki, H. , 
Toyo-oka,K. , Wang,K.H., Weitz,C, Whittaker , C . , Wilming,L., 
Wynshaw-Boris,A. , Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki, Y. 

Functional annotation of a full-length mouse cDNA collection 

Nature 4 09 (6821), 685-690 (2001) 

21085660 

11217851 

5 

The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 (bases 1 to 2669) 

Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci, P., 



Fukuda,S., Furuno,M., Hanagaki,T., Hara,A. , Hashizume, W . , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T. 
Hori,F., Imotani,K., Ishii,Y., Itoh,M., Kagawa,I., Kasukawa,T. 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C, Matsuyama , T . , Miyazaki , A. , Murata,M. , 



TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



CDS 



Numazaki,R. , Ohno,M., Ohsato,N., 
, Sakai,C, Sakai,K., Sakazume,N., 
Shinagawa,A. , Shiraki,T. , 
, Takahashi, F. , Takaku-Akahira , S . , 
, Toya # T., Yasunishi,A. , 



BASE COUNT 
ORIGIN 



Nakamura,M., Nishi,K., Nomura, K. 
Okazaki,Y. # Saito,R. # Saitoh, H., 
Sano,H., Sasaki, D., Shibata,K., 
Sogabe,Y., Tagami,M., Tagawa,A. 
Takeda , Y . , Tanaka , T . , Tomaru, A. 
Muramatsu,M. and Hayashizaki, Y. 
Direct Submission 

Submitted (16-APR-2002) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome -res@gsc . riken. go. jp, 
URL :http: //genome. gsc . riken.go.jp/, Tel : 81-45-503-9222 , 
Fax:81-45-503-9216) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : ht tp : //genome . gsc . riken . go . jp/ 
URL:http: //fantom . gsc . riken . go . jp/ . 

Location/Qualifiers 

1. .2669 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/ db_x r e f = " FANT0M_DB :4832401B07" 

/db_xref="taxon: 10090" 

/clone=" 48324 01B07" 

/ 1 i s sue_t ype= " head " 

/clone_lib=" RIKEN full-length enriched mouse cDNA library" 
/dev_stage="0 day neonate" 
220. .1548 

/note="unnamed protein product; ENDOTHELIN B RECEPTOR 

PRECURSOR (SWISSPROT|P48302, evidence: FASTY, 100%ID, 

100%length, match=1326) 

putative" 

/codon_start=l 

/protein_id="BAC36337 .1" 

/db_xref="GI : 26345372" 

/ 1 ran s 1 a t i on = " MQS PASRCGRALVALLLACGFLGVWGE KRGF P PAQATLSLLGTK 
EVMTPPTKTSWTRGSNSSLMRSSAPAEVTKGGRGAGVPPRSFPPPCQRNIEISKTFKY 
INTI VSCLVFVLGI IGNSTLLRI I YKNKCMRNGPNILIASLALGDLLHI I IDI PINTY 
KLLAEDWPFGAEMCKLVPFIQKASVGITVLSLCALS I DRYRAVASWSRI KGIGVPKWT 
AVE I VLI WWSVVLAVPEAIGFDMI TSDYKGKPLRVCMLNPFQKTAFMQFYKTAKDWW 
LFSFYFCLPLAITAVFYTLMTCEMLRKKSGMQIALNDHLKQRREVAKTVFCLVLVFAL 
CWLPLHLS R I LKLTLYDQSN PHRCELLS FLLVLD Y I G I NMASLNS C I N P I AL YL VS KR 
FKNCFKSCLCCWCQTFEEKQSLEEKQSCLKFKANDHGYDNFRSSNKYSSS" 
714 a 613 c 593 g 749 t 



Query Match 23.7%; Score 1020; DB 11; Length 2669; 

Best Local Similarity 84.7%; Pred. No. 1.4e-134; 

Matches 1169; Conservative 0; Mismatches 205; Indels 6; Gaps 2; 
Qy 193 AAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCATGCAGCCGCCTCCA 252 

1 1 1 1 II lllllll III III I I MINI I I I MINI 1 1 1 1 1 1 

Db 175 AAACAGCAGAGCGGCTACCAGACTCTCACAGGAGCAAGCTGTAACATGCAATCGCCCGCA 234 

Qy 253 AGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTGTCGCGGATCTGG 312 

II I 1 ! 1 1 1 1 1 I II Mill III I I I II I III 

Db 235 AGCCGGTGCGGACGCGCCTTGGTGGCGCTGCTGCTGGCCTGTGGCTTCTTGGGGGTATGG 2 94 

Qy 313 GGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTC - - - CGCTTTTGCAAACCGCAGAG 369 

1 1 1 1 1 1 I Mill Mill 1 1 1 1 I Mill I Mill II MM 

Db 2 95 GGAGAGAAAAGAGGATTCCCACCTGCCCAAGCCACGCTGTCACTTCTCGGGACTAAAGAG 354 

Qy 37 0 ATAATGACGCCACCCACTAAGACCTTATGGCCCAAGGGTTCCAACGCCAGTCTGGCGCGG 42 9 

Illlllllllllllllllllllll III III III llllllll 1 1 1 

Db 355 GTAATGACGCCACCCACTAAGACCTCCTGGACCAGAGGTTCCAACTCCAGTCTGATGCGT 414 

Qy 43 0 TCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCGCCACGCACC 48 9 

II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I lllllll III Ml III MIMI I I 

Db 415 TCCTCCGCACCTGCGGAGGTGACCAAAGGAGGGAGGGGGGCTGGAGTCCCGCCAAGATC- 473 

Qy 4 90 ATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATCAAC 54 9 

II MIMI Ml Mill II II MMIM IIIMM MINIUM 

Db 4 74 --CTTCCCTCCTCCGTGCCAACGAAATATTGAGATCAGCAAGACTTTTAAATACATCAAC 531 

Qy 55 0 ACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGA 609 

III lllllll Mill lllllllllll II MINIMI II M Ml 

Db 532 ACGATTGTGTCGTGCCTCGTGTTCGTGCTAGGCATCATCGGGAACTCCACGCTGCTAAGA 591 

Qy 610 ATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCT 669 

II IIIIIIIIIIIIIIIIIIIIIII II IIIIIIIIIIIIIIIIIIIIIII Mill 

Db 592 ATCATCTACAAGAACAAGTGCATGCGCAATGGTCCCAATATCTTGATCGCCAGTCTGGCT 651 

Qy 670 CTGGGAGACCTGCTGCAC^TCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCA 72 9 

lllllllllll lllllllll MM Mill II II II lllllll MM III 

Db 652 CTGGGAGACCTACTGCACATCATCATAGACATACCCATTAACACCTACAAGTTGCTCGCA 711 

Qy 730 GAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCC 78 9 

I IMM I Ml M II M I I! II M IIIMM || || 

Db 712 GAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCCTTCATACAGAAGGCTTCT 771 

Qy 7 90 GTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCT 84 9 

1 1 1 1 1 1 1 1 1 II MMIM MMMM I I | | | | M II I I I II II I I I I M I I I I 

Db 772 GTGGGAATCACAGTGCTGAGTCTTTGTGCTCTAAGTATTGACAGATATCGAGCTGTTGCT 831 

Qy 850 TCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTG 909 

lllllllll I M IMM I I IMM M M II IMM I II 

Db 832 TCTTGGAGTCGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTA 891 

Qy 910 ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACG 969 

i r j 1 1 1 1 1 1 f 1 1 1 r 1 1 r 1 1 1 1 1 r 1 1 1 1 1 r r 1 1 iiiimmmiimmimi mimi 

Db 892 ATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCCGAAGCCATAGGTTTTGATATGATTACG 951 



Qy 



970 ATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCT 1029 



Db 


952 


Qy 


1030 


Db 


1012 


Qy 


1090 


Db 


1072 


Qy 


1150 


Db 


1132 


Qy 


1210 


Db 


1192 


Qy 


1270 


Db 


1252 


Qy 


1330 


Db 


1312 


Qy 


1390 


Db 


1372 


Qy 


1450 


Db 


1432 


Qy 


1510 


Db 


1492 



Mllllllllllll II I Mill Mill Mill I Ml Mill 

TCGGACTACAAAGGAAAGCCCCTAAGGGTCTGCATGCTTAATCCCTTTCAGAAAACAGCC 1011 
TTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTG 1089 

I M II M IIMIMM immmmmimmimmimm immimi 

TTCATGCAGTTTTACAAGACAGCCAAAGATTGGTGGCTGTTCAGTTTCTACTTCTGCTTG 1071 



II I III III MM II I II III III M MUM II MUM I II Mill 



AGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAACC 1209 

II II MIMMMIMM Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMIMIM II 

AG CGGTATG CAGATTGCTTTGAATGATCACTTAAAG CAGAGACGAGAAGTGG C CAAGACA 1191 
GTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTQ\GCAGGATT 1269 

Mill IIIIIIIIMI II Mill Mill MMIMMIMMMIMM MM 

GTCTTCTGCCTGGTCCTCGTGTTTGCTCTCTGTTGGCTTCCCCTTCACCTCAGCCGGATC 1251 



IMMIMIM II III I MM MM I II Mill III IIIIMM II 



TTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAATT 138 9 

MMI 1 1 1 1 1 1 1 1 MMMMMIMMIMM MM 1 1 1 1 1 1 1 1 II Mill 

TTGGTTTTGGACTACATTGGTATCAACATGGCTTCTTTGAACTCCTGCATCAATCCAATC 1371 
GCTCTGTATTTGGTGAG CAAAAGATTCAAAAACTGCTTTAAGTCATG CTTATG CTG CTGG 144 9 

MM IMM III I II I Ml M> II II MMM M IMMIMI 

GCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGTTTGTGCTGCTGG 1431 



MMI I Mill MMI IIIIIIIIIIIMI MIMIII Ml I MIIIMM 

TGCCAAACGTTTGAGGAAAAGCAGTCCTTGGAGGAGAAGCAGTCCTGCCTGAAGTTCAAA 
GCTAATGATCACGGATATGACAACTTCCGTTCCAGTAATAAATACAGCTCATCTTGAAAG 

II M I T II I IMU II Mill MMIMMIMM Ml I I 



RESULT 6 

AL571798/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



AL571798 1201 bp mRNA linear EST 31-MAY-2003 

AL571798 Homo sapiens PLACENTA COT 25 -NORMALIZED Homo sapiens cDNA 
clone CS0DI030YM19 3-PRIME, mRNA sequence. 
AL571798 

AL571798 .2 GI : 31293189 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1201) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 

On Feb 16, 2001 this sequence version replaced gi : 12929453. 



Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 7006. r For 
more information about this cluster, see 
http: //www. genoscope. ens . fr/ 

cgi-bin/cluster . cgi?seq=CS0DI030AG10NPl&cluster=7006 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DI030AG10NP1 . 
FEATURES Loca t ion/Qual i f iers 

source 1. .1201 

/organism="Homo sapiens" 
/mol_type=" mRNA " 
/db_xref ="taxon: 9606" 
/clone="CS0DI03 0YM19" 

/tissue_type=" PLACENTA COT 25 -NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 25 -NORMALIZED" 
/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 

BASE COUNT 331 a 259 c 269 g 299 t 43 others 

ORIGIN 

Query Match 22.9%; Score 987; DB 9; Length 1201; 

Best Local Similarity 96.0%; Pred. No. 7.8e-130; 

Matches 1030; Conservative 10; Mismatches 28; Indels 5; Gaps 3; 

Qy 489 CATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATCAA 54 8 

h M M : I I I I I I IMI II I I I II II I 

Db 1068 CM(^TCTCCCYCCCCCGTGCC^GACC(^Ta^GATCAAG--AGAYTTC^y^TA(^T(^ 1011 

Qy 54 9 CACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAG 608 

Ml I- 1 II II I II' II ll Ihllllllllllllllll 

Db 1010 CAC-GTTKTKTCCTGCCTTGTGTTCGTGCTGGGATCATC--GGRACTCCACACTTCTGAG 954 

Qy 609 AATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGC 668 

MIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIII 

Db 953 AATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGC 8 94 

Qy 669 TCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGC 728 

M' II! P II II I 'III II III IMI II II IIMII | 

Db 8 93 TCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGC 834 

Qy 72 9 AGAGGACTGG CCATTTGGAG CTGAGATGTGTAAG CTGGTGCCTTTCATACAGAAAG CCTC 788 

IIIMIIIIIIIMMIMIIIMIIIIMIIMIIMIIIIIIIIIIIIIIMIIIIII 

Db 833 AGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTC 774 

Qy 78 9 CGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGC 848 

MINI MM III MM II Mill IIHIIIMIII III II II II MINIMI II MM 

Db 773 CGTGGGAATCACTGTG CTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAG CTGTTG C 714 

Qy 84 9 TTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTT 908 

MMMMMMMMIMMMMMMMMIMMMMMIMMIIIIMIMM 



Db 


713 


Qy 


909 


Db 


653 


Qy 


969 


Db 


593 


Qy 


1029 


Db 


533 


Qy 


1089 


Db 


473 


Qy 


1149 


Db 


413 


Qy 


1209 


Db 


353 


Qy 


1269 


Db 


293 


Qy 


1329 


Db 


233 


Qy 


1389 


Db 


173 


Ov 


144 9 


Db 


113 


Qy 


1509 


Db 


53 



713 TTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTT 654 
GATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTAC 968 

I II II I II 'I! Mill II Mill !l li II llilllll MM 

GATTTGGGTGGT CT CTGTGGTTCTGG CTGTC CCTGAAG CCATAGGTTTTGATATAATTAC 594 
GATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGC 1028 

IIIIIIIIIIIIIIMIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

GATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGC 534 
TTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTT 1088 

IIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMII 

TTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTT 4 74 
G CCATTGGC CATCACTG CATTTTTTTATACACTAATGACCTGTGAAATGTTGAGAAAGAA 1148 

MMIMIMM I ! Ml IMM I I I M; Mill II III I 

GCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTGAGAAAGAA 414 
AAGTGG CATG CAGATTG CTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGG CCAAAAC 1208 

II 1 1 1 II 1 1 M 1 1 M I II Ml II II II I II 1 1 1 II MM II M I M MM I Ml I M 

AAGTGGCATGCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAAC 354 
CGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGAT 1268 

: I ■ , -' ■ M. . MM, Ml M I II I 'M M 

CGTCTTTTGCCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGAT 2 94 
TCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCT 1328 

M II MM: I I ill IMM I I I M II Mill M I I 



MIM I I MM II IMIM I M I II II IMM MM 



TGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTG 1448 

I IMM I IMMI IMM IM M II MMIMIMI IMM MIM 

TGCTCTGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTG 114 
GTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAA 1508 

: I I I . ! ■ : I I ! i I : I , i 

GTGCCAGTCATTTGAAGAAAAACAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAA 5 4 



IIIIMMII I i I : : | Ml 



RESULT 7 

BI520706/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



BI520706 957 bp mRNA linear EST 29-AUG-2001 

603071813T1 NIH_MGC_119 Homo sapiens cDNA clone IMAGE : 5163746 3', 
mRNA sequence. 
BI520706 

BI520706.1 GI :15345498 
EST . 

Homo sapiens (human) 
Homo sapiens 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 957) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http: //image. llnl .gov 
Plate: LLAM11406 row: j column: 03 
High quality sequence start: 4 
High quality sequence stop: 954. 
Location/Qualifiers 
1. .957 

/organism="Homo sapiens" 

/mol__type="mRNA M 

/db_xref ="taxon: 9606" 

/ c 1 one = " I MAGE : 5163746" 

/ 1 i s sue_type= "medul la " 

/lab_host=" DH10B" 

/ c 1 one_l ib= " N I H_MGC_1 1 9 " 

/note="0rgan: brain; Vector: pCMV-SP0RT6; Site_l: Not I; 
Site_2: EcoRV (destroyed); RNA source normal medulla from 
anonymous male age 27. Library is oligo-dT primed and 
directionally cloned (EcoRV site is destroyed upon 
cloning) . Average insert size 1.3 kb, insert size range 
0.9-3 kb. Library is normalized and enriched for 
full-length clones and was constructed by C. Gruber 
(Invitrogen) . Research Genetics tracking code 013. Note: 
this is a NIH_MGC Library." 
289 a 225 c 199 g 244 t 



Query Match 20.1%; Score 866; DB 12; Length 957; 

Best Local Similarity 96.5%; Pred. No. le-112; 

Matches 917; Conservative 0; Mismatches 30; Indels 3; 



Gaps 



3; 



Qy 



Db 



569 TGTTCGTGCTGGGGATCATCGGGAACTCC^CACTTCTGAGAATTATCTACAAGAACAAGT 628 

1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ill III II llllllllllllll Mill I MM 

956 TGTTCGTGCTGGGGATCATCCGGACTTCCACACTTCTGAGAATACTCTACAGAACCAAGG 8 97 



Qy 

Db 

Qy 

Db 
Qy 



62 9 GCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTGCACA 688 

l - 1 1 ; l i l 1 1 i f f l i | | lllliliiiiiiiiiiiiii 

8 96 CAATGCGAAACGGTCCCAATATCTGATCGCCAGCTTGGCTTCTGGGAGACCTGCTGCACA 83 7 



689 



748 



TCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCCATTTGGAG 

II 'IN! II IM Mllllllllll I MM. II 1,11 Ml: I 

836 TCGTCATTGACATCCCTATCAAGGTCTACAAGCTG - TTGCAGAGGACTGGCCATTTGGAG 778 
74 9 CTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCCGTGGGAATCACTGTGCTGA 8 08 

II IM I I II MM II I IMM I I Ml II I Ml 



Db 



777 CTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCCGTGGGAATCACTGTGCTGA 718 



Qy 


809 


GTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCTTCTTGGAGTAGAATTAAAG 


868 






III IIIIIIIIMMIMIII Mllllllllllllllll MINI IIIIMIII 




Db 


717 


GTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCTTCTTGGAGTAGAATTAAAG 


658 




ft 69 

o O J? 


pa attpppp-ttppa a a atppa pappaptapa a attpttttp atttppptpptptptptpp 

VJ,ri_ri. X 1 VJVJVJVJ ± J. V- v — rlrtrvri 1 VjO-ri.VwJ-i.Vj V ri-Vj 1 i-iVJxviri. 1 lul 1 1 1 111 VjVjVJ 1 Vjvj 1 L 1 L Ivji Vjvj 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 II 1 1 II 1 1 1 1 1 I 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 II II 1 1 II 1 II 1 1 1 1 II II 11 1 II 1 1 




Db 


657 


GAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTGATTTGGGTGGTCTCTGTGG 


598 




_7 Z _7 


TTPTHnPTnTPPPTna apppaTappTTTTraTaTa a tta ppaTPPapTa pa a appa a ptt 

1 1 v_ 1 vjvj V— Ivj 1 v_v_v_ 1 vji-ii-ivj L. v_-H. 1 111 1 onliilAH 1 l/iLuAlbbAL I A LAnAuuAAb 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


y o o 


Db 


597 


1 1 1 1 II 1 1 1 1 M II 1 I 1 I 1 1 1 1 1 II 1 1 1 1 1 I 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TTCTGG CTGTCCCTGAAGC CATAGGTTTTGATATAATTACGATGGACTACAAAGGAAGTT 


538 






^TPTHPnaaTPTHPTTPPTTPaTPPPPTTPana aP,aPaP,PTTTP2\TPPaPTTTTn paapa 
rtlLl VJV-VjJ-irt. 1 V_ 1 VJ V_ 1 luLI 1 1 — H. 1 V- V-V— VJ 1 1 V_^Vji-\rYVjAv_ri.VJV_, 111 V__H. 1 LjL-Haj 111 1 ALAAvjA 








1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 I 1 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 




Db 


537 


ATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCTTTCATGCAGTTTTACAAGA 


478 


wy 




pz\p,pz\ a a apattpptppptpttpaptttptatttptp pttpppa TTPPPrATPA nTn^A t 

v_Avjv_i\rii\tt.vjA 1 1 vjvj 1 vjvjL. 1 Vj 1 1 LAvj 111L1A11 J, L.1 vjv_ 1 lvjv_LAl 1 vjvj L. LA 1 LA.L 1 vj LAI 


lino 
1 1 Uo 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II II 1 1 II 




Db 


477 


CAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTGCTTGCCATTGGCCATCACTGCAT 


418 


Pa/ 

wy 




TT-TTTTATA PA PT A A TV"* A Of* 1 TV" 1 TV 1 AAA TPTTPAPA A A f" 1 A A A A PTPPPA mppn r~* 7\ 'i"l'f 1 /""T > 
111111.H. l-H.v_-ri.v_ 1 ±\t\ 1 vjAv_v_ 1 o 1 VjAAA 1 vj 1 1 VjAvjAAAvjAAAAvj 1 vjvjL-A. 1 vj vJAvjA 1 1 VJV_ 1 


lib/ 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 




Db 


417 


TTGTTTTATACACTAATGACCTGTGAAATGTTGAGAAAGAAAAGTGG CATG CAGATTGCT 


358 


vy 


X X D O 


TTA A A TP ATP A PPT A A APPAPAPA PPPPA AP,TP,HPPA AAA PPPTPTTTTP Hr"TT i r ,r rr i r , TT t 
1 1 J-^t\±\ 1 Vj-tt. 1 V__riV_V_ li^rLri.VjV_iH.VJ.r_^ 1 i 1 Vj V_ L. 1 VJVJ 1 L.L. 1 1 


l_iZ / 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


357 


TTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAACCGTCTTTTGCCTGGTCCTT 


298 


Ov 
wy 


1228 


nTPTTTPiPPPTPTP,PTP,P, PTTPPPPTTP Z\ PPTPQ P Pfl PP a TTPTPA APPTPA PTPTTTA T 
VJ 1 V_ 1 1 1 VJV_V_V_ 1 V_ 1 VjV_ 1 VJVJV- 1 1 V_V_V_V_ 1 1 L~H.V_.V_ 1 L-H.VJ V riVjOrl 1 1L 1 vjAAvj v_ 1 V_A.v_ i v_ 1 1 1 A 1 


1 OQ7 

Izo / 






1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 II 1 1 1 1 1 




Db 


297 


GTCTTTGCCCTCTGCTGGCTTCCCCTTCACCTCAGCAGGATTCTGAAGCTCACTCTTTAT 


238 


Ov 
wy 


1288 


A A T PA P A A TP ATPPPAATAPA TP TP A A PTTTTP A P PTTTPTP TTP PTA TTP P A PT A T A TT 
1 v__r_OJ-u-l 1 Vjrl l V_ V_ v_-rii-i 1/^vjH. 1 vj 1 vji-u-iv_, 1 1 1 1 VJ.rt.vjv_. 1 1 1v_1vj1 IvjvjIAI IvjVjAL. iAlnl 1 


1 1 A 1 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 I I I I I 




Db 


237 


AATCAGAATGATCCCAATAGATGTGAACTTTTGAGCTTTCTGTTGGTATTGGACTATATT 


178 


Ov 

wy 


1 ^4ft 


PPTATPA APATPPPTTPAPTP A ATTPPTPPATTA A PPPA A TTP PT 1 f^Tf 1 T A TTTfT'TP A r*r*i 
VJVjliH.1 V--r\_H.V__ri. 1 VJVJV_ 1 1 V_AV_ 1 VjAA 1 1 V_ V_ 1 VJ LA 1 1AALLLAA1 ILrLlLlvjlAi 1 1 vjvj 1 vjAvjL 


14 07 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


177 


GGTATPAA PA TPP PTTP A PTP A A TTPPTPPA TTA A PPP A ATTPPTPTPTATTTPPTPAPP 

vJVJ ±-n.± * — rtriV— ri. ± VJVJV_ X i. * — r_V_ 1 VJ-rlrt. 1 1LL1 VJ\ — ri. 1 ± rlriLV-V-fifi 1 1 VjV_ iV_.lVjl.r_l 1 1 VJVJ 1 VJ.ri.VjV_, 


I 1 Q 

I I o 


Qy 


1408 


AAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCT - GCTGGTG CCAGTCATTTGAAGA 


1466 






Illllllllll II. !l ll: MM II II lllllllllllllllillllll 




Db 


117 


AAAAGATTCAACAACTGCTTTAAGTCATGCTTATGCTGGCTGGTGCCAGTCAT 


58 


Qy 


1467 


AAAACAGTCCTTGGAGGAAAAGC^GTCGTGCTTAAAGTTC^^GCTAATG 1516 








IIMIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIMI 1 llllll 1 




Db 


57 


AAAACAGTCCTTGGAGGAAAAG CAGT CGTGCTTAAAGTTTACAG CTAAGG 8 





RESULT 8 

AL553041/C 

LOCUS 

DEFINITION 



ACCESSION AL553041 



AL553041 1201 bp mRNA linear EST 31-MAY-2003 

AL553041 Homo sapiens PLACENTA COT 2 5 -NORMALIZED Homo sapiens cDNA 
clone CS0DI072YK22 3 -PRIME, mRNA sequence. 



VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 

and Polayes,D. 



BASE COUNT 
ORIGIN 



AL553 041.2 GI : 31274855 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates ; 
1 (bases 1 to 1201) 
Li,W.B., Gruber,C, Jessee,J. 
Full-length cDNA libraries and normalization 
Unpublished 

On Feb 15, 2001 this sequence version replaced gi : 12892503. 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 7006. r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi-bin/cluster. cgi?seq=CS0DI072BFHNPl&cluster=7006 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DI072BF11NP1 . 

Locat ion/ Qual i f iers 

1. .1201 

/organism="Homo sapiens" 
/mol_type= ,, mRNA" 
/db_xref="taxon: 9606" 
/clone= ,! CS0DI072YK22" 

/tissue_type=" PLACENTA COT 25 -NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 25 -NORMALIZED" 
/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
428 a 102 c 202 g 394 t 75 others 



Query Match 20.0%; Score 860.2; DB 9 

Best Local Similarity 89.9%; Pred. No. 6.2e-112 
Matches 966; Conservative 32; Mismatches 54 



Length 1201; 
Indels 22; 



Gaps 



7; 



Qy 


3179 


AAATTTTTACTTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAAGCTAGTAA 

MM 1 M MM MM : 1 1 , 1 ,h ::: Mh : 1 

AAATCCMAAATTAMCTTTTTTY YTTWAAWGKCKGKCCACWKTKKRAAAAK - - CTWKA 


3238 


Db 


1061 


1007 


Qy 


3239 


TGTTGTTTTCTGTCAATATTGAATGTGATGGTACAGTA^ 

M MUM MM IN || Ml MIMIMMM h 1 

AATTTKTTTCTTCCAATATTAATTTT TTTACATAAACCAAACCCAACAATTTKCC 


3298 


Db 


1006 


952 


Qy 


3299 


CA.GAAAGAAAGAGCAATAATAATTAATTG& 

II M = h IhlMllllllhhIIMIh IhlMIIIIIIMIIII 

ATAAAKTAA - - KKTCAWTATWATTAATTCACMCMCCATATKTATYCTATTTATAAATCAC 


3358 


Db 


951 


894 


Qy 


3359 


CCACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTATCATAGAA 

llllhMIHIIIIIIIII ::|IIMIIIIIIIIIIMII|:||||||MIIIIIII 
CC^C^WACTTKTTCTTTAATTCMWTCCCAATCACTTTTTCAGAGKCCTGTTATCATAGAA 


3418 


Db 


893 


834 



Qy 


3419 


Db 


833 


Qy 


3478 


Db 


773 


Qy 


3538 


Db 


713 


Qy 


3598 


Db 


653 


Qy 


3658 


Db 


593 


Qy 


3718 


Db 


534 


Qy 


3778 


Db 


474 


Qy 


3838 


Db 


414 


Qy 


3898 


Db 


354 


Qy 


3958 


Db 


294 


Qy 


4018 


Db 


234 


Qy 


4078 


Db 


174 


Qy 


4137 


Db 


114 


Qy 


4190 


Db 


54 



3419 GTCATTTTAGACTCTCAATTTTAAATT-AATTTTGAATCACTAATATTTTCACAGTTTAT 3477 

1 1 M 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 !l I ■ 1 1 1 1 1 1 1 1 1 III 1 1 

33 GTCATTTTAGACTCTCAATTTTAAATTAAATTTTGAATCACTAATATTTTCACAGTTTAT 774 

TAATATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTT 3537 

Ml IMII II; II IMI II II II, III II I III :lll I 

TAATATATTTAATTTCTATTTAAATTTTAGATTATTTTTATTACCATGTACTGAATTTTT 714 

ACATCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAA 3597 

I Ml I II I II II II 1 1 1 1; 1 1 1 MIIIMI Mill Ml M I Ml .1 1 1 1 1 1 Ml Ml 

ACATCCTGATACCCTTTCCTTCTCCATGTCAGTATCATGTTCTCTAATTATCTTGCCAAA 654 
TTTTGAAACTAC^CACAAAAAGCATACTTGCATTATTTATAATAAAATTGCATTCAGTGG 3657 

II 1 1 : : 1 1 i ' 1 1 .11 ill I MM I 

TTTTGAAACTAC^CAC^UUVAGC^TACTTGCATTATTTATAATAAAATCGCATTC^GTGG 594 
CTTTTTAAAAAAAATGTTTGATTCAAAACTTTAACATACTGATAAGTAAGAAACAATTAT 3717 

iiiiii . . : i i , . ■ ii . i 

CTTTTT - AAAAAAATGTTTGATTCAAAACTTTVACATACTGATAAGTAAGAAACAATTAT 53 5 
AATTTCTTTACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTTCAACTTCAAAACA 3777 

Mil II IMMIMII IIMIIMM II III III I III MIMM 

AATTTCTTNACATACTCAAAACCAAGATAGAAAAAGGTGCTATCGTT(^CTTaUU^ACA 475 
TGTTTCCTAGTATTAAGGACTTTAATATAGCAACAGACAAAATTATTGTTAACATGGATG 3837 

II IMM Ml IMM MM M ^ MMMI Ml III IMMM M 

TGTTTCCTAGTATTAAGGACTTTAATATVGCMACAGACAAAATTATTGTTAACATGGATG 415 
TTACAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCT 3897 

M I IM IMM IM II MIMI I! II IMM Ml II M 

TTACAGCTCAAAAGATTTATAAAAGATTTTAACCTATTTTCTCCCTTATTATCCACTGCT 355 
AATGTGGATGTATGTTCAAACACCTTTTAGTATTGATAG CTTACATATGG C CAAAGGAAT 3 957 

I1IIIMIIIIMMIMIIIIIIIIIIIIIIIMIMIMIIIIIIIIIIIIIIMIII 

AATGTGGATGTATGTTCAAACACCTTTTAGTATTGATAGCTTACATATGGCCAAAGGAAT 295 
ACAGTTTATAG CAAAACATGGGTATG CTGTAG CTAACTTTATAAAAGTGTAATATAACAA 4 017 

II M Ml III M Ml Ml II I MMMI III III II MM I 

ACAGTTTATAGCAAAACATGGGTATGCTGTAGCTAACTTTATAAAAGTGTAATATAACAA 235 
TGTAAAAAATTATATATCTGGGAGGATTTTTTGGTTGCCTAAAGTGGCTATAGTTACTGA 4 077 

II M III Ml I III Ml II I MMMI M' IM II II I 

TGTAAAAAATTATATATCTGGGAGGATTTTTTGGTTG CCTAAAGTGG CTATAGTTACTGA 175 
TTTTTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTT - AACAACTACCTTATT 4136 

MM III Mil MM II I II I Ml 1 1 MM llllllllllllll III MM MIMM I 

TTTTTTATTATGTAAGCAAAACCAATAAAAATTTAAGTTTTTTTCAACAACTACCTTATT 115 
TTTCACTGTACAGACACTAATTCATTAAATACTAA TTGATTGTTTAAAAGAAA 4189 

II II Ml III II Ml Ml II I I II III 

TTTCACTGTACAGACACTAATTCATTAAATACTCACACTCTCGCACTTGTTTAAAAGAAA 5 5 
TATAAATGTGACAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAG 4243 

M ; ' I M, II iM Ml I M I M I M'.'M 

TATAAAKGBGMCAAGTGGACATTATTTATGTTAAATATACAATTATCAAGCAAG 1 



RESULT 9 
AL543805 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



AL543805 942 bp mRNA linear EST 31-MAY-2003 

AL543805 Homo sapiens PLACENTA COT 2 5 -NORMALIZED Homo sapiens cDNA 
clone CS0DI005YG20 5-PRIME, mRNA sequence. 
AL543805 

AL543805.2 GI : 31265651 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 942) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 

On Feb 15, 2001 this sequence version replaced gi: 12876284. 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 7006. r For 
more information about this cluster, see 
http : //www. genoscope. ens . fr/ 

cgi-bin/cluster . cgi?seq=CSODI005BD10QPl&cluster=7006 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DI005BD10QP1 . 

Location/Qualifiers 

1. .942 

/organism="Homo sapiens" 
/mol_type=" mRNA" 
/db_xref="taxon: 9606" 
/clones" CS0DI005YG2 0" 

/tissue_type=" PLACENTA COT 25 -NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 2 5 -NORMALIZED" 
/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
318 a 176 c 155 g 284 t 9 others 



Query Match 19.8%; Score 851; DB 9; Length 942; 

Best Local Similarity 98.1%; Pred. No. 1.3e-110; 

Matches 876; Conservative 4; Mismatches 11; Indels 2; Gaps 



2; 



Qy 18 03 TTTACAGTTAGCACTTCAACATAGCTCTTAACAACT 1862 

II II M M INI I I II II II I II II I IN II I 

Db 51 TTCCCGGGATGCACTTCAACATAGCTCTTAAC^CTTCC^GGATATTCACAC^ACACTTA 110 

Qy 1863 GGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTAAA 1922 

M IT'I I IIUI I '[ I 'll'll 'I M iril' I I II' 

Db 111 GGCTTAAAAATGAGCTCACTCAGAATTTCTATTCTTTCTAAAAAGAGATTTATTTTTAAA 17 0 



Qy 


1923 


Db 


171 


Qy 


1983 


Db 


231 


Qy 


2042 


Db 


291 


Qy 


2102 


Db 


351 


Qy 


2162 


Db 


411 


Qy 


2222 


Db 


471 


Qy 


2282 


Db 


531 


Qy 


2342 


Db 


591 


Qy 


2402 


Db 


651 


Qy 


2462 


Db 


711 


Qy 


2522 


Db 


771 






Db 


831 


Qy 


2642 


Db 


891 



TCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATGAA 1982 

IIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

TCAATGGGACTCTGATATAAAGGAAGAATAAGTCACTGTAAAACAGAACTTTTAAATGAA 23 0 
GCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAAC - AACTTTTCAATTAATATT 2041 

I I II MM III ! MUIM III llllllllllllllllll 

GCTTAAATTACTCAATTTAAAATTTTAAAATCCTTTAAAACTAACTTTTCAATTAATATT 2 90 
ATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCATT 2101 

1 1 1 1 M 1 1 1 1 1 1 MM 1 1 1 Ml I II 1 1 li I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 1 1 

ATCACACTATTATCAGATTGTAATTAGATGCAAATGAGAGAGCAGTTTAGTTGTTGCATT 350 
TTTCGGACACTGGAAACATTTAAATGAT CAGGAGGGAGTAACAGAAAGAG CAAGGCTGTT 2161 

IIIIIIIIIMIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIMII 

TTTCGGACACTGGAAACATTTAAATGAT CAGGAGGGAGTAACAGAAAGAG CAAGGCTGTT 410 



:i imi mi i ii m i ,n :i ii :i im,i iiiiii i 



I II II MMMI I IM II: MMM MM I IMM I 



ATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGTTAAGATC^AACCTCAC 2341 

Ml! MM M M M II MMM M I MM MMI IMM 

ATAATACTTTTTAAAAGARAATTATTACATCCTTTACATTCAGTTAAGATC^AACCTCAC 5 90 
AAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTTTGAATCTGTCATTCAC 24 01 

lllllllllllllllllllll MM Ml M II M MM IMM III | 

AAAGAGAAATAGAATGTTTGANAGGCTATCCCAAAAGACTTTTTTGAATCTGTCNTTCTC 650 
ATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAAATCTTCTTTTTT 24 61 

I Ml M II I II I IM II IMMIMI M IMM III 

ATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTATTAAMATCTTCTTCTTT 710 
CACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACAC 2521 

MMI II II I IMI II MM MM Ml II II II I 

CACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAATACTTACCTACATACAC 770 
TGCATGTAGATGATTAAATGAGGGCAGG CCCTGTG CTCATAG CTTTACGATGGAGAGATG 2581 

MIMIIMI IMM II MMI I II II Mill MMI I 

TGCATGTAGATGATTAAMTGAGGGCAGG CC CTGTG CTCATAG CTTTACGATGGAGAGATG 83 0 
CCAGTGACCTC^TAATAAAGACTGTGAACTGCCTGGTGCAGTGTCCAC^TGACAAAGGGG 2641 

IM M I Ml M M MM II MM MM M M Ml II M I 

CCAGTGACCTCATAATAAAGACTGTGAACTGCCTGGTGC^GTGTCCACATGAC^W^GGGG 8 90 
CAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTCTAGCATATG 2694 

IIIMIIMIIIIMIIMIIIIIIIMIIMI IIIMIIIIIIIIIIIIM 

CAGGTAGCACCCTCTCTCACCCATGCTGTGGTT - AAATGGTTTCTAGCATATG 942 



RESULT 10 
BQ229233 

LOCUS BQ22 9233 891 

DEFINITION AGENCOURT 7511051 NIH MGC 72 



bp mRNA linear EST 02-MAY-2002 
Homo sapiens cDNA clone IMAGE : 6055288 



5 ' , mRNA sequence . 
ACCESSION BQ22 9233 

VERSION BQ229233.1 GI: 20410633 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 8 91) 

AUTHORS NIH-MGC http://mgc.nci.nih.gov/. 

TITLE National Institutes of Health, Mammalian Gene Collection (MGC) 
JOURNAL Unpubl i shed 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 
Tissue Procurement: ATCC/DCTD/DTP 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : //image. llnl . gov 
Plate: LLAM13315 row: e column: 17 
High quality sequence stop: 696. 
FEATURES Loca t i on/ Qua 1 i f i er s 

source 1. .891 

/ organ ism=" Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/clone=" IMAGE: 6055288" 

/ 1 i s sue_t ype= " me 1 ano t i c mel anoma " 

/lab_host="DH10B (phage-resistant ) " 

/ c 1 one_l ib= " NI H_MGC_7 2 " 

/note="0rgan: skin; Vector: pCMV-SPORT6; Site_l: Not I; 

Site_2: Sail; Cloned unidirect ionally . Primer: Oligo dT. 

Average insert size 2 kb. Library constructed by Life 

Technologies . " 
BASE COUNT 291 a 162 c 155 g 283 t 

ORIGIN 



Query Match 19.7%; Score 848; DB 13; Length 891; 

Best Local Similarity 99.1%; Pred. No. 3.6e-110; 

Matches 884; Conservative 0; Mismatches 5; Indels 3; Gaps 3; 
Qy 2 086 GTTTAGTTGTTGCATTTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAAC^G 2145 

IIIIIIIIIIIIIIIIIIMIMIIMIIIIIIIIIMIMIIIIIIIIIIIIIIMIII 

Db 1 GTTTAGTTGTTGCATTTTTCGGACACTGGAAACATTTAAATGATCAGGAGGGAGTAACAG 60 

Qy 2146 AAAGAGCAAGGCTGTTTTTGAAAATCATTACACTTTCACTAGAAGCCCAAACCTCA 2205 

IIIMIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

Db 61 AAAGAGCAAGGCTGTTTTTGAAAATCATTACACTTTC^ 12 0 

Qy 22 06 TCTGCAATATGTAACCAACATGTCACAAACAAGCA 2265 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIII 

Db 121 TCTGGAATATGTAACCAAGATGTCACAAACAA 180 

Qy 2266 C^GCTGAATTTAAAATATAATACTTTTAAAAAGAAAATTATTACATCCTTTAC^TTCAGT 2325 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 . 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 



181 CAGCTGAATTTAAAATATAATACTTTTAAAAAGAAAATTATTACATCCTTTACATTCAGT 24 0 



Qy 232 6 TAAGATCAAACCTCACAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTT 23 85 

IIIIIIIIIIIIIIIIMM III Ml III lllllllllllllll IIIMIMM 

Db 241 TAAGATCAAACCTCACAAAGAGAAATAGAATGTTTGAAAGGCTATCCCAAAAGACTTTTT 3 00 

Qy 2386 TGAATCTGTCATTCACATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTAT 244 5 

IIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIMIIIIIII 

Db 301 TGAATCTGTCATTCACATACCCTGTGAAGACAATACTATCTACAATTTTTTCAGGATTAT 360 

Qy 2446 TAAAATCTTCTTTTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAAT 2505 

I II I I lllllllllllllllllllllllllllllllllllllllllllllll 

Db 361 TAAAATCTTCTTCTTTCACTATCGTAGCTTAAACTCTGTTTGGTTTTGTCATCTGTAAAT 42 0 

Qy 2 506 ACTTACCTACATACACTGCATGTAGATGATTAAATGAGGGCAGGCCCTGTGCTCATAGCT 2565 

IIIIIIIIIIIIIIIIIIIIMIIMIMIIIIIIIIIMIIIIIMIIIIIIIIIIIM 

Db 421 ACTTACCTACATACACTGCATGTAGATGATTAAATGAGGGCAGG CCCTGTG CTCATAG CT 48 0 

Qy 2566 TTACGATGGAGAGATGCCAGTGACCTCATAATAAAGACTGTGAACTG CCTGGTG CAGTGT 2625 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 481 TTACGATGGAGAGATGCCAGTGACCTCATAATAAAGACTGTGAACTG CCTGGTG CAGTGT 54 0 

Qy 2626 CCACATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTC 2685 

. ■ I I M II I II IM I I I I = 

Db 541 CCACATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTGTGGTTAAAATGGTTTC 600 

Qy 2686 TAGCATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACA 2745 

Ml I 1 I II I MM I' MM M I il I I II I III Ml I 

Db 601 TAGCATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAATCATACAGATTAGTACA 660 

Qy 2746 TTTAACAGCTACCTGTAAAGCTTATTACTAATTTTTGTATTATTTTTGTAAATAGCCAAT 2805 

M II I MM MM II I M II I I Mill I MM M I, I M 

Db 661 TTTAACAG CTACCTGTAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAG CCAAT 72 0 

Qy 28 06 AGAAAAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGC-AAAACTGCTTTTTGAG 2864 

MMIM M MM II MM MMIMMIMI I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 721 AGAAAAGTTTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAAACTGCTTTTTGAG 780 

Qy 2865 ACCGTAAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAG 2924 

lllllllllllllllll IMIIIMMIIIIIIMII Mill lllllllllllllll 

Db 781 ACCGTAAGAACCTCTTACCTTTGTGCGTTCCTGCCTAA- TTTTAAATCTTCTAAGCAAAG 839 

Qy 2 92 5 TGCCTTAGGATAG CTTGGG - ATGAGATGTGTGTGAAAGTATGTACAAGAGAA 2 975 

II I II I MM lllllllllllllllll lllllllllll II 

Db 84 0 TGCCTTAGGATAGCTTGGGAATGAGATGTGTGTGAAAATATGTACAAGAAAA 8 91 



RESULT 11 

AL546465 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



AL546465 1201 bp mRNA linear EST 31-MAY-2003 

AL546465 Homo sapiens PLACENTA COT 2 5 -NORMALIZED Homo sapiens cDNA 
clone CS0DI03 0YM19 5 -PRIME, mRNA sequence. 
AL546465 

AL5464 65.2 GI : 31268299 
EST. 

Homo sapiens (human) 
Homo sapiens 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1201) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 

On Feb 15, 2001 this sequence version replaced gi : 12879606. 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 7006. r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CSODI030AG10QPl&cluster=7006 . r. Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DI030AG10QP1 . 

Location/Qualifiers 

1. .1201 

/organism="Homo sapiens" 
/ mo 1 _ t yp e = " mRNA " 
/db_xref ="taxon: 9606" 
/clone=" CS0DI030YM19" 

/tissue_type=" PLACENTA COT 2 5 -NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 2 5 -NORMALIZED" 
/note= n 1st strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
272 a 289 c 321 g 289 t 30 others 



Query Match 19.0%; 
Best Local Similarity 97.2%; 
Matches 854; Conservative 



Score 816.2; DB 9; 
Pred. No. le-105; 
7; Mismatches 15; 



Length 12 01; 
Indels 3; 



Gaps 



3; 



Qy 


178 


Db 


219 


Qy 


238 


Db 


279 


Qy 


298 


Db 


339 


Qy 


358 


Db 


399 


Qy 


418 


Db 


459 



II II IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIII 



ATGCAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 2 97 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIMIIIIIIIIIIIIIII 

ATGCAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGC 338 
CTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTG 357 

IIIIIIIMIIMMIIIMIMIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIMI 

CTGTCGCGGATCTGGGGAGAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTG 3 98 



IIIIIIIIIIIMMMIIIIIIIIMIIIMMMIIIMMMIIIIIIMIIIIII 



AGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCT 4 77 

IMMIIIIIMIIIMMIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIM 

AGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCT 518 



Qy 


478 


Db 


519 


Qy 


538 


Db 


579 


Qy 


598 


Db 


639 


Qy 


658 


Db 


699 


Qy 


718 


Db 


759 


Qy 


778 


Db 


819 


Qy 


838 


Db 


879 


Qy 


898 


Db 


938 


Qy 


958 


Db 


998 


Qy 


1018 


Db 


1056 



CCGCC^CGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 537 

IIIIMIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

CCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTC 578 
AAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCC 597 

1 1 : . 1 1 1 1 ' I [ I i 1 1 I I ! 

AAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCC 638 
ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 657 

I! II II III I III I II IMIM III 'I M Ih Mi E; II 

ACACTTCTGAGAATTATCTACAAGAACAAGTGCATGCGAAACGGTCCCAATATCTTGATC 698 
GCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTAC 717 

I llllll III III II Mill II II! II h III M IIMII II 

GCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATCCCTATCAATGTCTAC 758 
AAG CTG CTGG CAGAGGACTGG CCATTTGGAGCTGAGATGTGTAAGCTGGTG CCTTT CATA 777 

I, II II Ih i Ml I II IMI II M II ,1 Ih Mill 

AAG CTG CTGG CAGAGGACTGG C CATTTGGAG CTGAGATGTGTAAG CTGGTGCCTTTCATA 818 
CAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATAT 837 

IIIIIIIMIIIIIIIIIIIIIIIIIIMIIMIMIIIIIIIIIIIIIIIIIIIIIMI 

CAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATAT 878 
CGAG CTGTTG CTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAG CAGTA 8 97 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIMIII lllllllllllllllll 

CGAG CTGTTG CTT CTTGGAGTAGAATTAAAGGAATTGGGGTT - CAAAATGGACAG CAGTA 937 
GAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTT 957 

h II h hi! hhh II hi II II h II II Ih II II 

GAAATTGTTTTKATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTVTT 997 
GATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTT 1017 

i mi i hi h h h iiiiiiiiiiiiiiiiiiiiiiii hi i 

GATATAATTACGATGGACTACAAAGG -AGTTATCTGCGAATCTGCTTGCTT -MWCCCSGT 1055 

CAGAAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAA 1056 

h II IIIIIIIIIIIIIMI || : | | | | 

YARAAGAAAGCTTTCATGCAGTTTACAAAAMAGCAAAAA 1094 



RESULT 12 

BI858627 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



BI858627 972 bp mRNA linear EST 10-OCT-2001 

603389094F1 NIH_MGC_87 Homo sapiens cDNA clone IMAGE : 5398 054 5 \ 
mRNA sequence. 
BI858627 

BI 858 627 . 1 GI : 15 999374 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 972) 

NIH-MGC http; //mgc . nci . nih . gov/ . 

National Institutes of Health, Mammalian Gene Collection (MGC) 



JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: DCTD/DTP 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
ht tp : / / image . 1 lnl . gov 
Plate: LLAM12014 row: 1 column: 23 
High quality sequence stop: 899. 

Location/Qualifiers 

1. .972 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone=" IMAGE: 5398054" 

/tissue_type="mammary adenocarcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/ clone_l ib= " NI H_MGC_8 7 " 

/not e=" Organ: breast; Vector: pCMV-SP0RT6; Site_l : Not I; 
Site_2 : Sail; Cloned unidirectionally; oligo-dT primed. 
Average insert size 1.383 kb. Library enriched for 
full-length clones and constructed by Life Technologies. 
Note: this is a NIH_MGC Library." 
230 a 241 c 244 g 256 t 1 others 



Query Match 18.8%; 
Best Local Similarity 96.3%; 
Matches 903; Conservative 



Score 808.6; DB 12; 
Pred. No. 1.3e-104; 
0; Mismatches 25; 



Length 972; 
Indels 10; 



Gaps 



7; 



Qy 


321 


Db 


1 


Qy 


381 


Db 


61 


Qy 


441 


Db 


121 


Qy 


501 


Db 


181 


Qy 


561 


Db 


241 


Qy 


621 


Db 


301 



mill 



M M I I II IM ! II I I !l I II Ml 



I III I IM II I M II MMI Ml M. I II I IMIM 



TGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCGCCACGCACCATCTCCCCTCC 

I MMM I III MM IMIMM II II I IM I IMIMM 

TGCGGAGGTGCCTAAAGGAGACAGGACGGCAGGATCTCCGCCACGCACCATCTCCCCTCC 



500 



180 



560 



CCCGTGCCAAGGACCCATCGAGATC^GGAGACTTTCAAATAC^TCAACACGGTTGTGTC 

MM MM I II II I II II III III MM III MINN llllllllll MINI MM II 

CCCGTGCCAAGGACCC^TCGAGATC^GGAGACTTTCAAATACATC^CACGGTTGTGTC 24 0 



CTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAA 

Mill I Ml III MM II I I Ml IM II II I IMIMM 

CTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAA 



620 



300 



GAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCT 680 

IIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIMIIIIIIIIIIIIIIIMIIIIIIII 

GAACAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCT 360 



Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 
Qy 

Db 

Qy 
Db 

Qy 
Db 



681 GCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCC 740 

1 1 1 Ml 1 1 1 1 1 1 1 II I II II MM 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M III II 1 1 1 

361 GCTGCACATCGTCATTGACATCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCC 420 
741 ATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATACAGAAAGCCTCCGTGGGAATCAC 8 00 

II II II 1 1 II 1 1 II III II MM II I M 1 1 1 II Ml M 1 1 II 1 1 II I II II I M II M 

421 ATTTGGAGCTGAGATGTGTAAG CTGGTG CCTTTCATACAGAAAG CCTCCGTGGGAATCAC 480 
801 TGTG CTGAGTCTATGTG CT CTGAGTATTGACAGATATCGAG CTGTTG CTTCTTGGAGTAG 860 

1 1 1! 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E M 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 

481 TGTG CTGAGTCTATGTG CTCTGAGTATTGACAGATATCGAG CTGTTG CTTCTTGGAGTAG 54 0 
861 AATTAAAGGAA - TTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTGATTTGGGTGG 919 

M 1 1 1 : 1 1 1 1 1 II II II II II II II M I II II II II M M II I 1 1 1 1 1 1 1 ! I i 1 1 1 1 

541 AATTAAAGGAACTTGGGGTTCCAAAATGGACAGCAGTAGAAATCG - TTTGATTTGGGTGG 599 



920 



600 



980 



TCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACGATGGACTACA 97 9 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIIIIIIIMIMIIIIIMM 

TCTCTGTGGTTCTGGCTGTCCCTGAAGCCATAGG - TTTGATATAATTACGATGGACTACA 658 



AAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCTTTCATGCAGT 1039 

1 1 1 1 r 1 1 1 1 1 1 1 1 ! I ) I It 1 1 ! M i 1 1 1 1 1 II 1 1 1 1 1 1 1 E 1 1 1 1 1 1 If M 1 1 1 1 1 1 i 1 1 1 

65 9 AAGGAAGTTATCTGCGAATCTGCTTGCTTCATCCCGTTCAGAAGACAGCTTTCATGCAGT 718 

104 0 TTTACAAGACAGCAAAAGATTGGT - GGCTGTTCAGTTTCTATTTCTGCTTGCCATT - GGC 1097 

I I I I M I I I I I I II I I II I I I I I I I I I I I I I I I II II I I I I I I I I II II I I I I II 
719 TTTACAAGACAGCAAAAGATTGGTGGGCTATTCAGTTTCTATTTCTGCTTGCCATNGGGG 778 

1098 CATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTGAGAAAGAAAAGTGGCAT 1157 

MMMMMMMMMMMMMIMMIMM llllllllll Ml llllll I 
77 9 CATCACTGCATTTTTTTATACACTAATGACCTGTGACATGTTGAGAACGAACAGTGGCTT 83 8 

1158 GCAGATTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAACCGTCTTTTG 1217 

llllll II llllllllllllllllllllllll II I I II I I I I 

83 9 GCAGAT - -GCTTTAATGATCACCTAAAGCAGAGACGGAA GTGGCAAAACCGTCTTTG 893 

1218 CCTGGTCCTTGTCTTTGCCCTCTGCTGGCTTCCCCTTC 1255 

Illlllll I I I I I I llllll I I I M I Mill | | 
8 94 CCTGGTCCCTGTCTTTGCCCTCTGCTGGGTTCCCTTAC 931 



RESULT 13 

AL570142/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



AL570142 942 bp mRNA linear EST 31-MAY-2003 

AL570142 Homo sapiens PLACENTA COT 25 -NORMALIZED Homo sapiens cDNA 
clone CS0DI005YG20 3-PRIME, mRNA sequence. 
AL570142 

AL570142 .2 GI : 31291568 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 942) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 



COMMENT On Feb 16, 2001 this sequence version replaced gi: 12926160. 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 910 06 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 7006. r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CS0DI005BD10NPl&cluster=7006 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DI005BD10NP1 . 
FEATURES Location/Qualifiers 
source 1. .942 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="CS0DI005YG2 0" 

/tissue_type=" PLACENTA COT 2 5 -NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 2 5 -NORMALIZED" 
/note="lst strand cDNA was primed with a Notl-oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 

BASE COUNT 284 a 135 c 152 g 315 t 56 others 

ORIGIN 



Query Match 18.7%; Score 802.4; DB 9 

Best Local Similarity 90.0%; Pred. No. 9.7e-104 
Matches 848; Conservative 54; Mismatches 34 



Length 942; 

Indels 6; Gaps 5; 



Qy 


2695 


Db 


942 


Qy 


2754 


Db 


882 


Qy 


2814 


Db 


822 


Qy 


2874 


Db 


762 


Qy 


2934 


Db 


702 


Qy 


2994 


Db 


642 


Qy 


3053 



TATAATG CTATAGTTAAAATACTATTTTTCAAAATCATACAGA - TTAGTACATTTAACAG 2753 

Ml II I III I! MM II 1 1 1 : II III II MM 

WAWAAKGCWATAGTTAAAWTRCTATTTTGCAAAATC^TR(^GATTTAGWA(^TTTAACAG 883 
CTACCTGTAAAG CTTATTACTAATTTTTGTATTATTTTTGTAAATAG CCAATAGAAAAGT 2813 

MM IMIM MM MIMIMI Mill IIMMIMM 

CTRCCTGTAAAGCTWWTWACTAATTTTTGTATWATTTTTGTAAATAGCCAATAKAAAAK^ 823 

TTGCTTGACATGGTGCTTTTCTTTCATCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGA 2873 

llh|::|h:hl||||MMIIhllllll!lll||MIIIIIIIIIIMIIMIIII 
TTG YTKRACWKGKTGCTTTTCTTTCWTCTAGAGGCAAAACTGCTTTTTGAGACCGTAAGA 763 

ACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGG 2 933 

MMIIIIIIIIIIMIIIIIIIIhlllMIIIIMMIIIIIIIIIIMIIIIIIIII 

RCCTCTTAGCTTTGTGCGTTCCTGCSTAATTTTTATATCTTCTAAGCAAAGTGCCTTAGG 703 
ATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAA 2 993 

IIIIIIIIIIIIIIIIIIMMIIIIMIMMMIIIIIIIIIIIIMIIIIIIIIIII 

ATAGCTTGGGATGAGATGTGTGTGAAAGTATGTACAAGAGAAAACGGAAGAGAGAGGAAA 643 
TGAGGTGGGGTTGGAGGAAACCCATGGGG - ACAGATTCCCATTCTTAGCCTAACGTTCGT 3052 

I II II II I II II II III II II I! I II II I MINIM 1 1 Mil I I MM II II I Ml 

TGAGGTGGGGTTGGAGGAAACCCATGGGGAACAGATTCCCATTCTAAGCCTAACGTACGT 583 
3 053 CATTGCCTCGTCACATCAATGCAAAAGGTCCTGAT^ 3112 



Ill Mill! 11:11: II I IIUI II II I MhM Mi I I 

Db 582 CATTGCCTCGTCACATCAATGCAAAAGGTCCTGATTTTGTTCCAGCAAAACACAGTGCAA 523 

Qy 3113 TGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAAT 3172 

M IMM Ml M ll,M ll,il Ml 11 ,1 III llMI I | 

Db 522 TGTTCTCAGAGTGACTTTCGAAATAAATTGGGCCCAAGAGCTTTAACTCGGTCTTAAAAT 463 

Qy 3173 ATGCCCAAATTTTTAC- - TTTGTTTTTCTTTTAATAGGCTGGGCCACATGTTGGAAATAA 3230 

llllllllhll : I I II III I | : - f I ! ! J ■ ! 1 1 I I 1 I I I III 

Db 4 62 ATG CCCAAAWTTAWACATTTGATTTATCWMTAAAWAGG CTGGG CCACATGTTGGAAATAA 4 03 

Qy 3231 GCTAGTAATGTTGTTTTCTGTCAATATTGAATGTGATGGTACAGTAAACCAAAACCCAAC 32 90 

i r ii i Mi iniMii Mill ill Mi i iru in i -I 

Db 4 02 GCTAGTAATGTTGTTTTCTGTCAATATTGAATGTGATGGTACARTAAACCAAAACCCAAC 343 

Qy 32 91 AATGTGGCCAGAAAGAAAGAGCAATAATAATTAATTCACACACCATATGGATTCTATTTA 3350 

Mill : MM HI || MhM MUM IMM HI I 

Db 342 AATGTAAMAARAAARAAAGAAMAAAAAAAATWAAATMAC^ 2 83 

Qy 3351 TAAATC^CCCACAAACTTGTTCTTTAATTTCATCCCAATCACTTTTTCAGAGGCCTGTTA 3410 

I III! MIMIII M II! : III I IIIMIM II MM Ml II: 1 1 1 1 Mill I 

Db 282 TAAATCACCVACAAACTTAWCCTTVAATTTCATSCCAATCASTTTBACAGAAGCCTGTAA 223 

Qy 3411 TCATAGAAGTCATTTTAGACTCTCAATTTTAAATTAATTTTGAATCACTAATATTTTCAC 34 7 0 

M II - II MM I II Ml MUM I I I 1 1 1 ! I ! II 

Db 222 TCATAGAAGBSATTTTAGACTSTSAATTTTAAATBAATTYTGAATCCCTAATATTTTCAC 163 

Qy 3471 AGTTTATTAATATATTTAATTTCTATTTAAATTTTAGATTATTT - TTATTACCATGTACT 352 9 

M I ll I II. I M I Mi Mill I I ; I .ll.lll II 

Db 162 AGTTTATTAATATATTTAATYTCTAWTAAAATTTTAGATTATTTATTATTACCATGTACT 103 

Qy 353 0 GAATTTTTACATCCTGATACCCTTTCCTTCTCCATG - TCAGTATCATGTTCTCTAATTAT 358 8 

lllllllllllllhlllll III Ml I Ml II I I IMIIIIIIIM 

Db 102 GAATTTTTACATCCWGATACTCTTACCTTCWCCATGATCAGTATCATATTCTCTAATTAT 43 

Qy 358 9 CTTGCCAAATTTTGAAACTACACACAAAAAGCATACTTGCAT 3630 

III Illllll MINI MM llllll Mill Ml 

Db 42 CTTCCCAAATTTTGAAACAACACACAAAAAGCATACTGAMAT 1 



RESULT 14 

BU557315 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BU557315 884 bp mRNA linear EST 16-SEP-2002 

AGENCOURT_102 534 86 NIH_MGC_109 Homo sapiens cDNA clone 
IMAGE:6585168 5', mRNA sequence. 
BU557315 

BU557315. 1 GI : 22907611 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 884) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 



Email : cgapbs-r@mail .nih.gov 
Tissue Procurement : ATCC 
cDNA Library Preparation: Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http: //image. llnl .gov 
Plate: LLCM2796 row: k column: 24 
High quality sequence stop: 64 9. 
FEATURES Locat ion/Qua 1 i f iers 

source 1. .884 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone=" IMAGE: 6585168" 

/ 1 i s sue_type= " t eratocarc inoma , eel 1 1 ine " 
/lab_host="DH10B (phage-resistant ) " 
/clone_lib= H NIH_MGC_109" 

/note="0rgan: ovary; Vector: pOTB7; Site_l : EcoRI ; Site_2 : 
Xhol; cDNA made by oligo-dT priming. Direct ionally cloned 
into EcoRI/XhoI sites using the following 5' adaptor: 
GGCACGAG ( G ) . Library constructed by Ling Hong in the 
laboratory of Gerald M. Rubin (University of California, 
Berkeley) using ZAP-cDNA synthesis kit (Stratagene) and 
Superscript II RT (Life Technologies) . Note: this is a 
NIH_MGC Library. » 

BASE COUNT 221 a 212 c 218 g 233 t 

ORIGIN 

Query Match 18.5%; Score 794.8; DB 13; Length 884; 

Best Local Similarity 98.1%; Pred. No. 1.2e-102; 

Matches 836; Conservative 0; Mismatches 12; Indels 4; Gaps 3; 
Qy 343 GCCACTCCGCTTTTGCAAACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCC 4 02 

mi ii mm I :i ii ;i i inn in mil h irn i 

Db 1 GCCACTCCGCTTTTGC-AACCGCAGAGATAATGACGCCACCCACTAAGACCTTATGGCCC 59 

Qy 403 AAGGGTTCCAACGCCAGTCTGGCX3CGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGAC 462 

MUM I MMM IMIIMI MIM III I III Mill III MM 

Db 60 AAGGGTTCCAACGCCAGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGAC 119 

Qy 4 63 AGGACGGCAGGATCTCCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAG 522 

IIMIIMIMMIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIMI 

Db 12 0 AGGACGGCAGGATCTCCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAG 179 

Qy 523 ATCAAGGAGACTTTCAAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGG 582 

IIIMI I Ml M IMM IM I Ml M I MMM I MMM 

Db 18 0 ATCAAGGAGACTTTCAAATACATCAACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGG 239 

Qy 583 ATCATCGGGAACTCC^CACTTCTGAGAATTATCTACAAGAAC^GTGCATGCGAAACGGT 642 

IMIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 24 0 ATC^TCGGGAACTCCACACTTCTGAGAATTATCTACAAGAACAAGTGC^TGCGAAACGGT 2 99 . 

Qy 643 CCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATC 702 

IMM I II IMIMI II M MMM IMMMMMI IM 

Db 300 CCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACATC 359 



Qy 


703 


Db 


360 


Qy 


763 


Db 


420 


Qy 


823 


Db 


480 


Qy 


883 


Db 


540 


Qy 


943 


Db 


600 


Qy 


1003 


Db 


660 


Qy 


1063 


Db 


720 


Qy 


1123 


Db 


780 


Qy 


1180 


Db 


840 



C CTATCAATGT CTACAAG CTGCTGG CAGAGGACTGG CCATTTGGAGCTGAGATGTGTAAG 762 

i;i I ! ' I II !: I i I I. I ' 

C CTATCAATGT CTACAAG CTGCTGG CAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAG 419 

CTGGTGCCTTTCATACAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTG 822 

' I I I i . I ! ' I I I I I I : II 

CTGGTGCCTTTCATACAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTG 479 

AGTATTGACAGATATCGAGCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCA 882 

mi mi urn n n mm m n imimi mm i n 

AGTATTGACAGATATCGAGCTGTTGCTTCTTGGAGTAGAATTAAAGGAATTGGGGTTCCA 539 
AAATGGACAGCAGTAGAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCT 942 

M MM IIMI II IMI III I Ml I MM Ml I II 

AAATGGACAGCAGTAGAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTCTGGCTGTCCCT 599 
GAAGCCATAGGTTTTGATATAATTACGATGGACTACAAAGGAAGTTATCTG CGAATCTG C 1002 

I'll M M IMIMIMI IMI I II IMIMMII MM 

GAAGCCATAGGTTTTGATATAATTACGATGGACTACAAAGGAAGTTAT CTG CGAATCTG C 659 



II MM MM MMI MM MM I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTGCTTCATCCCGTTCAGAAGACAGCTTTCATGCA 719 
TGGCTGTTCAGTTTCTATTTCTGCTTGCCATTGGCCATCACTGCATTTTTTTATACACTA 112: 
TCGCTGTTCAGTTTCT 779 
ATGACCTGTGAAATGTT - GAGAAAGAAAAGTGG CATG CAGATTG - - CTTTAAATGATCAC 1 1 7 c 

lllllllllllllllll Mill i I i I, II llllllllllllll 



RESULT 15 

BQ719386 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BQ719386 911 bp mRNA linear EST 16-JUL-2002 

AGENCOURT_84 69884 Lupski_sympathet ic_trunk Homo sapiens cDNA clone 
IMAGE: 6195994 5', mRNA sequence. 
BQ719386 

BQ719386. 1 GI : 218 58283 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 {bases 1 to 911) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Dr. James R . Lupski 



FEATURES 

source 



BASE COUNT 
ORIGIN 



cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Agencourt Bioscience Corporation 

Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort i urn/ LLNL at: 

http : / / image . llnl . gov 

Plate: LLAM13603 row: 1 column: 11 

High quality sequence stop: 606. 
Location/Qualifiers 
1. .911 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone=" IMAGE: 6195994" 
/ sex="male" 

/tissue_type="sympathetic trunk" 
/dev_stage="adult , 16 yr" 
/lab_host="DH10B" 

/ clone_l ib= " Lupski_sympa thet ic_t runk" 

/note= M Vector : pCMV-SPORT6 (Life Technologies); Site_l: 
Not I; Site_2: Sail; cDNA made by oligo-dT priming. 
Directionally cloned using the following adaptors: 
5 ' -TCGACCCACGCGTCCG-3 ' and 

5 ' -GACTAGTTCTAGATCGCGAGCGGCCGCCCT (15) -3 ' . Size selected > 
1 kb for average insert length 1.9 kb. This is a primary 
library, non-amplified. Library constructed by Life 
Technologies and donated by J. Lupski, M.D./Ph.D. (Baylor 
College of Medicine) ; available through Life 
Technologies . " 
273 a 181 c 167 g 289 t 1 others 



Query Match 18.4%; 
Best Local Similarity 95.6%; 
Matches 859; Conservative 



Score 792; DB 13; Length 911; 
Pred. No. 2.9e-102; 
0; Mismatches 31; Indels 9; 



Gaps 



4; 



Qy 



Db 



2310 ATCCTTTAC^TTCAGTTAAGATCAAACCT(^CAAAGAGAAATAGAATGTTTGAAAGGCTA 2369 

MM I Mill I! IMM II II IHIIIM MMMM IMIIMIM 

1 ATCCTTTACATTCAGTTAAGATCAAACCTCACAAAGAGAAATAGAATGTTTGAAAGGCTA 60 



Qy 2370 TCCOW^GACTTTTTTGAATCTGTCATTCACATACCCTGTGAAGACAATACTATCTAC^ 242 9 

6i i''liLL^ 



Qy 243 0 ATTTTTTCAGGATTATTAAAATCTTCTTTTTTCACTATCGTAGCTTAAACTCTGTTTGGT 24 89 

I I M, MM I MMM M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 121 ATTTTTTCAGGATTATTAAAATCTTCTTCTTTCACTATCGTAGCTTAAACTCTGTTTGGT 180 

Qy 24 90 TTTGTCATCTGTAAATACTTACCTACATACACTGCATGTAGATGATTAAATGAGGGCAGG 254 9 

IIIIIIIIIMIIIIIIIIIIIIIMIMIMIMIIIIIIIIIIIIIIIIIIIMIIII 

Db 181 TTTGTCATCTGTAAATACTTACCTACATACACTGCATGTAGATGATTAAATGAGGGCAGG 24 0 

Qy 2550 CC CTGTG CTCATAGCTTTACGATGGAGAGATG CCAGTGACCT CATAATAAAGACTGTGAA 2 609 

M M MM. I I M M II II IMM II IMM MMI, MM 

Db 241 CCCTGTGCTCATAGCTTTACGATGGAGAGATGCCAGTGACCTCATAATAAAGACTGTGAA 300 

Qy 2610 CTGCCTGGTGCAGTGTCC^CATGAO\AAGGGGCAGGTAGC^CCCTCTCTCACCCATGCTG 2 669 



Db 


301 


llllllllllllllliiiiiiiiiiiiiiiiiiiiijiiiji.it 

N 1 1 1 . 1 1 i 1 M 1 1 1 1 1 1 ■ 1 1 1 1 1 Ml M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1! 

CTGCCTGGTGCAGTGTCCACATGACAAAGGGGCAGGTAGCACCCTCTCTCACCCATGCTG 


360 


Qy 


2670 


TGGTTAAAATGGTTTCTAGCATATGTATAATGCTATAGTTAAAATACTATTTTTCAAAAT 
i j i i i i i i i i i i i i i i i i i i i i i i i > i i i . 

MMIMUMIMII lll'IIIIIIMIIII. llllllll MINIM MINI 

TGGTTAAAATGGTTTCTAG CATATGTATAATG CTATAGTTAAAATACTATTTTTCAAAAT 


2729 


Db 


361 


420 


Qy 


2730 


CATACAGATTAGTACATTTAACAGCTACCTGTAAAGCTTATTACTAATTTTTGTATTATT 

MMIIIIMIMMIIIMI MMMMMIMMMMMMMIMM Ml 1 

CATACAGATTAGTACATTTAACAG CTACCTGTAAAG CTTATTACTAATTTTTGTATTATT 


2789 


Db 


421 


480 


Qy 


2790 


TTTGTAAATAG CCAATAGAAAAGTTTG CTTGACATGGTG CTTTTCTTT CATCTAGAGG CA 

iiiitiiiiiiiiiiiiiiiiiiiitiiitiiiiiiiiii. ........... 

IIIIIIIIIIIMMIIMIII MINIMI Mill IU Ml III llllllll MM Ml 

TTTGTAAATAGCCAATAGAAAAGTTTG CTTGACATGGTG CTTTTCTTTCATCTAGAGGCA 


2849 


Db 


481 


540 


Qy 


2850 


AAACTGCTTTTTGAGACCGTAAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATTTTTAT 

lllllllllllljllllllllllllllllllllillltll!!. s. . » 

1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 M II 1 1 1 II II 1 II 1 1 1 II Mill 

AAACTGCTTTTTGAGACCGTAAGAACCTCTTAGCTTTGTGCGTTCCTGCCTAATATTTAT 


2909 


Db 


541 


600 


Qy 


2910 


ATCTTCTAAG CAAAGTGCCTTAGGATAG CTTGGGATGAGATGTGTGTGAAAGTATGTACA 

|tll||ll|llt|||||||||| lllllllll lltlllllftj*..* 

IIMMMMI lllllll lllllllll IIMI MMIMI MIIMIMI 

ATCTTCTAAGCAAAGTG CCTTANGATAG CTTGCGATGAGATGTGTGTGAAAGTATGTACA 


2969 


Db 


601 


660 


Qy 


2970 


AGAGAAAACGGAAGAGAGAGGAAATGAGGTGGGGTTGGAGGAAACCCATGGGGACAGATT 

1 1 ! 1 M 1 1 1 1 1 1 1 1 [[ 1 1 1 1 1 1 1 1 1 1 1 IMIIIIIMIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGAGAAAACGGAAGAGAGAGGAAATGACGTGGGGTTGGAGGACACCCATGGGGACAGATT 


3029 


Db 


661 


720 


Qy 


3030 


CCC^TTCTTAGCCTAACGTTCGTCATTGCCTCGTC^CATCAATGCAAAAGGTCCTGATTT 
i i i i i i i i i i i i i i i i i < i i i i i i i i i i i i i i i ■ ■ > 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I I I lllllllll Ml 1 llllllll 

CCCATTCTTAGCCTAACGTTCGTCATTGCCTCCGTACATCAATGGCAAAAGCCCTGATTT 


3089 


Db 


721 


780 


Qy 


3090 


TGTTCCAGCAAAACACAGTGCAAT-GTTCTCAGAGTGA-CTTTCGAAATAAATTGGGCCC 

IMIM Mill lllllllll IIIIIIIIMIII Mill MIMMMMIIM 

TGTTCCCGCAAAC^CC^GTGCAATGGTTCTCAGAGTGACCTTTCAAAATAAATTGGGCCC 


3147 


Db 


781 


840 


Qy 


3148 


AAGAG CTTTAA CTCGGTCTTAAAATATG CCCAAATTTTTACTTTGTTTTTCT 3199 

'Jill 1 1 1 Mill MM II INI III lllllllll III 1 

CAGAAGCTTTAACCTCCAGCCCTTAAAATATGGACCCCAAATTTTTACTTTTGGTTTTT 899 


Db 


841 



Search completed: December 12, 2003, 23:52:09 
Job time : 7779.55 sees 



